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Abstract 

It  is  well  known  that  when  the  identical  elements  of  a  planar  receive  array  are  laid  out  in 
horizontal  rows  and  vertical  columns,  a  fast  Fourier  transform  or  FFT  can  be  used  to  efficiently 
realize  simultaneous  beams  laid  out  in  rows  and  columns  in  the  direction  cosines  associated 
with  the  azimuth  and  elevation  directions.  Here  a  more  general  formulation  and  an  associated 
design  discipline  is  developed.  Identical  elements  are  laid  out  on  an  arbitrary  planar  lattice — it 
could  be  square,  rectangular,  diamond,  or  triangular  and  might  display  tremendous  symmetry 
or  very  little — and  the  beams  in  direction-cosine  space  are  laid  out  on  an  arbitrary  superlattice 
of  the  dual  of  the  element-layout  lattice. 

The  generality  of  these  two  arbitrary  lattices  can  yield  significant  cost  reductions  for  large, 
many-beam  arrays  and  arises  from,  first,  formulating  the  desired  beam  outputs  using  a  discrete 
Fourier  transform  or  DFT  generalized  to  use  an  integer-matrix  size  parameter  and,  second, 
efficiently  realizing  the  required  real-time  computations  with  the  generalized  FFT  based  on  a 
matrix  factorization  of  that  size  parameter  that  is  developed  here.  This  generalized  FFT  includes 
as  special  cases  the  usual  ID  and  2D  FFT’s  in  radix-2  and  mixed-radix  forms  but  offers  many 
more  possibilities  as  well.  The  approach  cannot  outperform  but  does  match,  when  the  matrix 
size  parameter  factors  well,  the  N  log  N  computational  efficiency  of  the  usual  FFT. 

Examples  illustrate  a  design  discipline  for  the  two  lattices  that  involves  jointly  determining 
element  spacing,  steering  range  and  beam-layout  geometry,  grating-lobe  behavior,  and  FFT 
factorability  and  therefore  computational  efficiency. 


‘This  work  was  supported  by  the  Defense  Advanced  Projects  Research  Agency  (DARPA)  Strategic  Technologies 
Office  (STO)  and  by  the  Naval  Research  Laboratory  base  program. 


1  Introduction 


Let  us  orient  ourselves  to  the  problem  by  recalling  a  few  basics.  The  discrete  Fourier  transform  or 
DFT  Xk  =  Enfo  Xn  e~i2nkn'N  transforms  a  function  xn  on  finite  time  domain  0, . . . ,  N—  1,  the 
set  of  all  possible  remainders  modulo  N,  to  a  function  on  a  finite  frequency  domain  comprising 
the  same  possible  remainders. 

The  fast  Fourier  transform  or  FFT  is  any  of  a  wide  variety  of  efficient  recursive  computational 
approaches  to  computing  the  DFT.  The  efficiency  results  from  the  number  N  of  remainders  in  each 
of  the  two  domains,  often  termed  the  number  of  DFT  or  FFT  points  or  the  size  of  the  DFT  or  FFT, 
being  highly  composite.  One  difference  between  the  DFT  and  FFT  then  is  that  while  one  might 
speak  of  a  DFT  even  when  N  is  prime,  for  example,  there  is  little  point  in  speaking  of  an  FFT  in 
that  case,  as  the  FFT  degenerates  then  to  an  as-written  DFT  computation. 

An  exponentially  modulated  filter  bank  IE  El  is  a  bank  of  filters  each  of  which  has  a  frequency 
response  that  is  a  frequency-shifted  version  of  some  prototype  response.  The  frequency  shifts  are 
spaced  uniformly  across  one  period  of  the  frequency  responses  using  the  complex  exponentials 
of  a  DFT,  typically  realized  with  an  FFT.  The  application  most  familiar  to  radar  engineers  is  a 
Doppler  filter  bank.  (The  difference  between  an  exponentially  modulated  filter  bank  and  a  cosine- 
modulated  filter  bank  is  the  difference  respectively  between  shifting  an  /  =  0  passband  in  the  two- 
sided  frequency  response  to  /  =  /o  and  shifting  it  instead  to  /  =  ±/o-)  Because  the  exponentially 
modulated  filter  bank  uses  frequency  responses  that  are  not  symmetric  about  /  =  0,  it  is  naturally 
associated  with  complex  signals. 

DFT  beamsteering  uses  an  exponentially  modulated  bank  of  filters  in  spatial  frequency — each 
spatial  frequency  of  interest  corresponds  to  a  plane-wave  direction  of  arrival  or  DOA — to  provide 
a  narrowband  digital  receive  array  with  many  simultaneous  beams  such  as  would  be  needed,  for 
example,  certain  radar  designs  10.  That  beamsteering  can  be  viewed  as  filtering  should  be  no  great 
surprise,  given  that,  as  is  widely  appreciated,  an  array  antenna  with  identical  elements  (surrounded 
by  “guard  elements”  as  necessary  to  ensure  identical  embedded  element  patterns)  spaced  uniformly 
along  a  line  has  an  array  factor,  the  contribution  to  the  array  pattern  of  the  array’s  geometry,  weight¬ 
ing,  and  phasing,  that  is  essentially  identical  in  its  mathematics  to  the  frequency  response  of  an  FIR 
digital  filter.  (This  parallel  is  explored  further  below.) 

Tradition  favors  referring  to  FFT  beamsteering  rather  than  DFT  beamsteering,  and  because  the 
FFT  is  generally  used  in  implementation  this  is  certainly  acceptable.  (Occasionally  DFT  beam¬ 
steering  is  referred  to  as  FFT  beam forming,  but  this  is  simply  incorrect.)  Here,  however,  the  FFT 
terminology  is  reserved  for  a  particular  implementation  strategy. 

DFT  beamsteering  is  not  a  new  idea  but  is  widely  appreciated  in  the  signal-processing  com¬ 
munity,  where  it  is  a  classic  application  of  the  exponentially  modulated  filter  bank.  Even  there, 
however,  misconceptions  are  common  regarding  what  an  DFT-beamsteering  system  can  and  cannot 
do. 

FALSE:  DFT  beamsteering  can  only  be  used  on  ID  arrays  and  on  2D  arrays  with  elements  laid 
out  on  a  rectangular  grid.  CORRECTION:  It  can  easily  be  used  on  the  triangular  grid  and 
in  fact  on  any  lattice  grid,  i.e.  one  comprising  integer- weight  linear  combinations  of  a  pair  of 
linearly  independent  array-plane  basis  vectors. 

FALSE:  The  use  of  DFT  beamsteering  precludes  the  use  of  tapered  array  factors,  and  one  must 
live  with  a  high-sidelobe  array  factor  resembling  a  sine  function.  CORRECTION:  The  choice 
of  the  prototype  array  factor  that  is  shifted  by  the  filter  bank  to  the  various  beam  positions 
is  not  restricted.  Its  design  and  its  steering  with  the  DFT  are  completely  separate  matters 
mathematically.  Neither  limits  the  other. 
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FALSE:  DFT  beamsteering  spaces  beams  uniformly  in  azimuth  and/or  elevation  angle.  CORREC¬ 
TION:  It  spaces  beams  uniformly  in  the  direction  cosines.  Said  another  way,  unit  vectors  in 
the  beam  pointing  directions  project  onto  the  array  plane  as  a  uniform  grid.  The  grid  need 
not  be  rectangular  with  rows  and  columns,  but  it  must  be  a  lattice  grid  related  to  the  element- 
location  lattice  grid  in  a  disciplined  way. 

FALSE:  The  number  of  points  in  the  DFT  must  be  equal  to  the  number  of  elements.  CORREC¬ 
TION:  The  array  factor  as  a  function  of  the  projection  of  the  DOA  unit  vector  on  the  array 
plane  (or  line) — this  is  a  normalized  array -plane  spatial  frequency — is  periodic  (in  a  general¬ 
ized  lattice-related  way),  and  the  number  of  points  in  the  DFT  is  the  number  of  beams  created 
per  period.  It  need  not  be  related  to  the  number  of  elements  in  the  array.  It  can  be  larger  or 
smaller. 

FALSE:  The  number  of  FFT  points,  and  hence  the  number  of  beams  per  period  of  spatial  frequency, 
must  be  a  power  of  two  or  a  power  of  two  in  each  direction.  CORRECTION:  For  FFT-style 
computational  efficiency  it  does  need  to  be  highly  composite  (in  a  specific  matrix  sense  to  be 
explored  later),  but  it  need  not  be  a  power  of  two. 

FALSE:  The  2D  FFT  that  steers  a  planar  array  in  both  directions  is  always  computed  as  many  ID 
row-wise  FFT’s  and  column-wise  FFT’s.  CORRECTION:  Sometimes  it  can  be.  But  more 
choices  of  beamsteering  grids  arc  available  if  this  restriction  is  abandoned. 

The  purposes  of  this  paper  then  are 

1.  to  derive,  in  Section  [2}  a  general  approach  to  beamsteering  using  a  DFT  with  a  matrix  size 
parameter  for  an  array  with  identical  elements  located  on  an  arbitrary  planar  lattice, 

2.  to  present,  in  Section  [3]  system-design  examples  of  the  use  of  this  DFT  beamsteering  ap¬ 
proach  that  represent  the  variety  of  useful  possibilities  in  enough  detail  to  enable  the  deter¬ 
mined  system  designer  to  address  her  own  design  needs,  and 

3.  to  present,  in  Section  |4]  an  generalized  FFT  algorithm  that  can  be  used  to  realize  this  beam¬ 
steering  system  by  efficiently  computing  the  matrix-size  DFT. 

This  FFT  with  a  matrix  size  parameter  appears  to  be  new  to  the  engineering  application  commu¬ 
nity,  though  mathematically  it  represents  only  a  specific  and  relatively  simple  case  from  the  well- 
established  and  much  more  general  area  of  fast  Fourier  transforms  on  groups. 

Readers  only  marginally  motivated  to  face  the  mathematics  are  encouraged  to  look  ahead  to  the 
concluding  remarks  of  Section [5] before  proceeding. 

Notational  conventions.  Matrices  and  vectors  are  bold,  while  scalars  are  not.  Dimensionless  unit 
vectors  wear  pointy  hats,  x,  because  all  they  do  is  point,  and  other  vectors  may  be  dimensioned  or 
not.  Matrices  are  uppercase;  vectors  are  lowercase.  Notation  |N|  with  N  a  square  matrix  will  mean 
|det(N)|.  Vectors  conveying  position  information  are  column  vectors,  while  vectors  conveying 
spatial-frequency  information  are  row  vectors — different  orientations  for  different  Fourier  domains. 
Variables  of  these  spatial  and  spatial-frequency  types  are  in  bold  Roman  fonts  with  the  letter  k 
favored  for  spatial  frequencies.  Orthogonal  projection  onto  the  array  plane,  or  indeed  simple  a 
priori  restriction  to  the  array  plane,  is  denoted  using  an  underscore  k  that  is  as  flat  as  that  plane,  so 
underlining  and  a  hat  together  k  denote  the  projection  into  the  array  plane  of  a  nonunderlined  unit 
vector  k.  Position  and  spatial-frequency  vectors,  whether  or  not  confined  to  the  array  plane,  are 
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three-vectors.  A  bold  italic  font  is  used  for  two-vectors,  and  the  usual  letters  are  favored  when  they 
arc  integer  valued:  n.  As  is  the  custom  in  mathematics  the  set  of  all  integers  is  denoted  by  Z  for 
“zahlen the  German  verb  “to  count.”  Modifying  the  usual  custom  here  with  vertical  and  horizontal 
lines,  the  universal  sets  of  two-vectors  of  integers  are  denoted  with 


7  2  A  r  n  i 
J  i  [n2 J 

l2  4  {[ni,n2] 


If  V  is  a  set  of  numbers  or  vectors  then 


:  n\  G  Z,  n2  E  Z} 
:  n\  G  Z,  n2  €  Z}. 


LV  =  {Ln :  v  £  V} 
VR  4  {r;R:  vGV} 

V  +  a  =  a  +  V  =  {a  +  r; :  n  £  V} 


so  that,  for  example,  2Z  and  1  +  2Z  are  the  sets  of  even  and  odd  integers  respectively. 

Summations  shown  without  explicit  ranges  for  their  indices  are  doubly  infinite  on  each  integer, 
so  if  n  =  \n\~\  for  example, 


oo  oo 

means 

n  ni=—oo  77.2— — oo 

Integral  signs  without  limits  denote  integration  from  —  oo  to  oo. 

Primes  are  not  transposes  or  derivatives  but  simply  naming  tags  to  distinguish  closely  related 
variables,  and  the  primed  variable  may  be  introduced  first  to  reserve  the  unprimed  one  for  an  im¬ 
portant  later  use. 

Technical  matters  referred  to  “the  Allerton  paper  ||4|”  arc  really  referred  to  0  and  to  a  much- 
improved  journal  version  together.  The  latter,  currently  under  preparation,  will  be  the  final,  author¬ 
itative  version,  and  0  should  be  considered  a  draft  with  occasional  errors. 


£  £ 


2  Multi-beam  steering  with  a  matrix-size  DFT 


This  discussion  is  divided  into  three  parts.  The  third  leans  heavily  on  the  first  two,  which  are 
independent  of  each  other. 

Section  |2. 1|  develops,  as  preliminary  mathematical  background,  the  idea  of  division  with  re¬ 
mainder  of  an  integer  vector  by  a  nonsingular  square  integer  matrix  and  the  related  idea  of  taking 
an  integer  vector  modulo  such  a  matrix.  This  background  discussion  is  of  significant  length  but  is 
unavoidable,  as  these  concepts  lie  at  the  heart  of  the  entire  paper. 

Section |T2| develops  from  first  principles  and  the  usual  array  factor  and  beam  sum  for  an  array 
with  identical  elements  laid  out  on  a  2D  lattice.  What  is  new  here  is  not  the  basic  result  but  the 
approach,  for  confidence  that  all  the  signs  and  such  are  correct,  and  the  specific  formulation,  for 
elegance  and  generality. 


Section  2.3  then  formulates  multi-beam  steering  using  a  DFT  with  a  matrix  size  parameter, 


which  approach  is  apparently  new. 

Choosing  the  grid  of  points  onto  which  beams  are  steered  in  normalized  spatial  frequency  (the 
array-plane  projection  of  the  DOA  unit  vector)  is  largely  deferred  to  Section  [3]  where  it  will  be 
explored  through  examples. 
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2.1  Remaindered  division  of  an  integer  vector  by  a  square  integer  matrix 
Engineer:  “You  can  do  thaffl.T 

Mathematician:  “Of  course  you  can  do  that. . .  you  mean  someone  cares?” 

In  anticipation  of  the  development  to  follow  of  DFT  beamsteering,  here  integer  remainders 
modulo- Ar  arc  generalized  to  integer-vector  remainders  modulo-N,  where  N  is  a  nonsingular  2x2 
integer  matrix.  This  author  has  seen  nothing  published  on  the  notions  presented  subsequently  on 
actually  computing  a  vector  modulo  a  matrix,  but  the  material  on  remaindered  division  discussed 
first  is  known  to  every  graduate  student  of  mathematics  using  somewhat  different  terminology,  give 
or  take  various  details,  and  is  a  straightforward  specialization  to  lattices  of  coset  decomposition,  one 
of  the  most  elementary  parts  of  group  theory  at  the  senior  undergraduate  level  (5} . 

The  simple  decomposition  in  which  we  are  interested  here  can  be  written  in  terms  of  either  row 
or  column  vectors.  Using  n",  n',  and  n  from  the  set  of  integer  column  two-vectors  and  k",  k\ 
and  k  from  the  set  S2  of  integer  row  two-vectors, 

n"=Nn'+n  (1) 

k"=k'N  +  k.  (2) 

Each  of  these  expresses  a  kind  of  division  by  matrix  N,  where  n"  and  k"  are  the  dividends,  N  is 
the  divisor,  n'  and  k'  are  the  quotients,  and  n  and  k  are  the  remainders. 

If  the  quantities  in  0  were  all  scalars  with  N  =  4,  for  example,  the  most  common  conven¬ 
tion  would  be  to  require  remainders  in  the  set  {0, 1,2,  3}.  Each  divisor  could  then  be  computed 
with  floating-point  division  followed  by  a  floor  operation,  and  the  rest  of  the  decomposition  would 
follow.  But  of  course  unambiguous  results  would  also  obtain  if  instead  we  required  remainders 
in  {—2,  —1,  0, 1},  which  decomposition  could  be  computed  by  obtaining  the  divisor  with  floating¬ 
point  division  followed  by  rounding  to  the  nearest  integer.  These  are  just  two  of  many  possible 
remainder  sets,  any  of  which  would  result  in  unique  decompositions  of  an  integer  in  terms  of  an 
integer  quotient  and  an  integer  remainder.  If  we  relax  the  “division”  interpretation  and  require  only 
that  the  decomposition  be  unique,  we  can  take  the  remainder  set  to  be  {1,2, 3, 4}  or  {—2, 1,4,  7}, 
for  example.  What  matters  is  that  there  are  four  remainders  in  the  set  of  allowed  possibilities  and 
that  no  two  of  them  differ  by  a  multiple  of  four,  because  it  is  these  conditions  that  make  the  decom¬ 
position  both  assuredly  possible  and  unique. 

We  will  ultimately  find  a  way  below  to  arrive  at  decompositions  ([T])  and  ([2])  with 

n  <E  [^2/N^2]  C  %2 
k  6  [%2/%2N]  C  S2, 

where  the  notation  [|i2/N|i2]  is  from  elementary  group  theory  and  in  the  present  context  denotes 
an  arbitrary  but  fixed  set  of  |N|  column  two- vectors  no  two  members  of  which  differ  by  a  vector 
of  form  Nn'  a  right  integer  column- vector  multiple  of  matrix  N.  Similarly,  [S2/^2 N]  denotes  an 
arbitrary  but  fixed  set  of  |N|  =  |  det(N)|  integer  row  two-vectors  no  two  members  of  which  differ 
by  a  vector  of  form  k' N,  a  left  integer  row-vector  multiple  of  matrix  N.  We  can  think  of  [|i2/N{i2] 
and  [S2/Z2N]  as  our  sets  respectively  of  allowable  column-  and  row-vector  remainders  modulo-N. 

We  can  see  how  the  defining  conditions  on  [^2/N^2]  guarantee  the  uniqueness  of  decomposi¬ 
tion  0  by  supposing  for  the  moment  that  uniqueness  does  not  always  hold  and  looking  at  what  is 
implied  when  it  does  not.  It  would  certainly  imply  the  existence,  for  at  least  one  choice  of  n",  of 
two  distinct  decompositions,  so  that  n"  =  Nnj  +  nj  =  N n'2  +  n 2,  from  which  it  would  in  turn 
follow  that 

N(nj  -  n'2)  =  ra2  -  ni.  (3) 
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But  if  it  were  so,  would  n 2  =  n\  or  not?  Matrix  N  is  nonsingular,  so  ([3])  says  that  ri2  =  ni  would 
be  possible  only  if  n\  =  n'2,  in  which  case  the  two  decompositions  would  not  be  distinct  after  all. 
But  ([3])  also  says  that  n<i  /  m  would  only  be  possible  if  the  two  remainders  differed  by  a  right 
integer  column-vector  multiple  of  matrix  N.  As  we  have  disallowed  this  possibility,  nonuniqueness 
would  imply  the  impossible,  that  neither  n 2  =  n \  nor  ri2  /  n\.  The  decomposition  is  therefore 
unique.  We  have  proved  it  “by  contradiction.” 

But  how  do  we  know  that  decomposition  ([[}  is  even  possible,  that  such  a  combination  of  an  n' 
and  an  n  exists  for  every  possible  n"l  Vector  n'  can  be  any  element  of  and  the  possible  values 
for  it  therefore  occur  with  unit  density  in  the  plane.  The  set  of  possible  values  for  Nn'  then,  which 
set  we  denote  N^2,  has  density  ^  in  the  plane.  For  each  of  these  possible  values,  the  sum  N n'+  n 
can  reach  |N|  possible  totals,  so  the  density  of  these  totals  in  the  plane  is  ^  x  | N  =  1  provided 
only  that  all  such  totals  arc  distinct,  a  requirement  we  know  is  met  by  the  uniqueness  argument 
above.  Therefore,  the  density  of  totals  in  the  plane  is  the  same  as  the  density  of  values  n/'that  we 
wish  to  express  as  totals.  There  are  just  enough  totals  to  go  around,  and  every  vector  n"  has  a 
decomposition  ([!}. 

Basically  then,  decomposition  ([T}  makes  n"the  sum  of  a  multiple  of  matrix  N  and  any  of  |N| 
possible  remainder  vectors.  It  is  similar  for  decomposition  ([2])  and  k". 

A  vector  modulo  a  matrix.  There  is  a  particular  choice  of  remainder  set  [T2/N!|i2] ,  call  it 
(columns},  ^lat  w'll  make  decomposition  n"  =  Nn'+  n  simple  and  easy  to  mechanize.  In  other 
words,  it  will  allow  us  to  straightforwardly  compute  a  value 

n  =  n"  mod  N  €  {“£?J 

that  will  result  in  n"  =  Nn'+  n  for  some  n'  £  %2.  The  approach  exactly  parallels  the  computation 
of  n  mod  4  as  4frac(n/4),  where  function  frac(.T)  =  x  —  returns  the  fractional  part  of  its 
argument. 

Begin  by  left  multiplying  relationship  n"  =  Nn'+  n  by  N~ 1  and  taking  the  fractional  part  of 
each  vector  componentwise  to  obtain 

frac(N_1n//)  =  frac(n/  +  N_1n)  =  frac(N-1n).  (4) 

On  the  right  we  can  further  observe  that 

frac(N_1n)  =  N_1n  (5) 

whenever  each  element  of  vector  N_1  n  is  nonnegative  but  less  than  unity,  that  is,  whenever  N  1  n  £ 
[j  =  [0, 1)  x  [0, 1),  the  unit  square  closed  on  two  of  its  four  sides.  Let  us  tentatively  take  as  set 
{columns}  those  values  of  n  for  which  the  latter  condition  holds  or 

(6) 

While  this  definition  will  be  useful  formally  below,  an  alternative,  equivalent  version 

{”£,}  =  {"  6  A”  6  NQ).  (7) 

will  be  more  useful  for  visualization  later. 

Before  using  definition  ([6]),  we  must  test  it  against  the  two  requirements  every  remainder  set 
[^2/N^2]  must  meet.  First,  let  us  determine  the  number  of  vectors  in  { c}}] unins }  as  defined  this  way. 
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Every  real  number  can  be  written  as  the  sum  of  its  integer  and  fractional  parts,  so  the  real  plane,  the 
set  of  real  column  two-vectors,  can  written  as  the  disjoint  union 

disjoint 

U  (»'+□)• 

n'e$2 

and  in  addition  we  can  certainly  intersect  the  real  plane  with  its  subset  N-1^2  without  changing  the 
latter,  so 


But  =  Nn'+  ^2,  so  on  the  right  above 

(N-1^2)  n  (n'+  D)  =  (N_1(Nn'+  %2))  n  (n'+  □)  =  (n'+  N”1^2)  n  (n'+  D) . 

The  set  on  the  right  has  the  same  size  as  (N-1^2)  n  Qi,  because  the  number  of  points  contained  in 
the  intersection  is  not  affected  by  identical  translation  of  its  two  components.  The  sets  in  the  disjoint 
union  in  (j8])  are  therefore  of  identical  size,  which  size  must  then  be  just  the  density  per  unit  area  of 
the  left  side  of  ([8])  or  |N|.  So  this  is  then  the  size  of  {^^ns}  =  (N-1^2)  (~l  |~j  as  well,  just  as  it 
should  be. 

Second,  suppose  n\ .  ri2  G  {™udm^s}  as  tentatively  defined,  and  suppose  n  |  —  n2  =  NT  for 
some  nonzero  £  G  |i2so  that  N_1roi  —  N”1ri2  =  £.  But  each  of  N-1ni  and  N_1ri2  is  in  Qi,  so 
the  only  difference  between  them  that  is  possible  that  is  integer- valued  in  both  components  is  zero: 
n  i  =  n^.  We  have  proved  by  contradiction  that  no  two  elements  of  differ  by  a  right 

integer  column-vector  multiple  of  matrix  N. 

We  can  therefore  formally  adopt  (|6])  as  the  definition  of  [ip2/ Nip2] ,  as  (|6j)  meets  both  require¬ 
ments,  and  accomplish  decomposition  n"  =  Nn'  +  n  with  ri  G  |i2  and  n  G  {  by  noting 

that  ([4}  and  ([5])  now  give  N_1n  =  frac(N_1n)  =  frac(N_1n//),  from  which  it  follows  that 
n  =  n"  mod  N,  where 

n"  mod  N  =  N  frac(N~1n//).  (9) 


Geometric  interpretation  of  the  mod  N  remainder  sets.  Suppose,  for  example,  that 


N  = 


The  determination  of  (resp.  {mr^})  according  to  definition  ([7])  (resp.  definition  (p~2]> 

below)  are  shown  on  the  left  (resp.  right)  in  Figure  [I]  The  components  of  that  figure  are  summarized 


in  the  center  (resp.  right)  column  of  this  table. 
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Figure  1 :  Left  and  right  plot  respectively: 
for  N  =  [„?  i]  determine  remainder 
sets 

CSJ  =  t  [8]  .[-J]} 

Wwf}={[»  0],[0-1]}. 

graphically  using  <(7]»  and  ([12]). 


left 

right 

dots 

%2 

Z2 

circled  dots 

N$2 

z2n 

transformed 
unit  square 

ND 

□  N 

in  the  semi-open 
parallelogram 

r  mod-N  l 
l  columns/ 

f  mod-N  1 
l  rows  J 

Dots  depict  (resp.  Z2),  and  circled  dots  depict  its  subset  Nzf2  (resp.  Z2N),  which  comprises 
the  sums  of  the  integer-weighted  columns  (resp.  rows)  of  N.  If  %2  is  replaced  in  N|Z  (resp. 
Z2  is  replaced  in  Z2N)  with  the  set  {[o])[i]’[o]>[i]}  °f  t^ie  f°ur  co'umn  vectors  (resp.  set 
{[0  0],  [1  0],  [0  1],  [1  1]}  of  row  vectors),  the  result  is  the  four  corners  of  transformed  unit  square 
N  [j  (resp.  □  N)  in  definition  Q  (resp.  definition  ([12])  below).  If  that  parallelogram  boundary  is 
taken  as  closed  at  the  origin  and  on  the  edges  adjacent  to  the  origin  and  open  on  other  edges  and 
corners,  then  the  points  of  (resp.  Z2)  that  the  parallelogram  contains  are  precisely  the  vectors  in 

GS,}  =  {[8M-;]}  (resp.{miN}  =  {[«  »],[«-']})■  UO) 

The  parallelogram  tiles  the  plane  when  translated  by  elements  of  Nz(r  (resp.  Z2N)  and  so  can  be 
taken  as  a  period  of  the  latter  subset  of  the  plane.  When  NzjZ  (resp.  Z2N)  is  considered  instead  as  a 
subset  of  %2  (resp.  Z2),  the  remainder  set  {JXmins  }  (resP-  })  can  be  taken  as  its  period.  This 

corresponds  exactly  to  the  universality  and  uniqueness  of  decomposition  0  (resp.  Q).  In  Figure [T] 
the  density  of  in  the  plane  is  twice  that  of  its  subset  N|i2,  so  a  period  of  this  N^2  must  contain 
exactly  two  points  of  %2  and  the  size  of  the  remainder  set  is  determined.  In  general  there  are  an 
infinite  number  of  choices  for  a  plane-tiling  period,  not  necessarily  involving  parallelograms  or  even 
a  continuous  region,  and  these  choices  exactly  yield  the  [Z2/N|(2]  (resp.  [Z2/Z2N] )  alternatives  to 
{columns}  (resP-  {'Tows^  Here  any  combination  of  one  (uncircled)  dot  and  one  circled  dot  would 
be  allowable  as  a  remainder  set  [^2/N^2]  (resp.  [Z2/Z2N]). 

Computing  mod  N  safely.  The  graphical  procedure  just  illustrated  is  adequate  for  simple  exam¬ 
ples,  either  to  determine  the  entire  remainder  set  or  to  test  a  particular  vector  for  membership  in 
that  set.  (Is  it  in  the  parallelogram?)  But  points  can,  as  is  the  case  in  Figure  [TJ  lie  exactly  on  the 
open  boundaries  of  the  parallelogram.  This  invites  problems  with  computational  noise  when  these 
procedures  are  automated.  Can  this  issue  be  reliably  sidestepped  somehow  in  practice? 

Yes,  we  can  in  fact  easily  make  the  matlab  computation  of  ([9])  fast  and  robust  and  in  no  danger 
of  suffering  edge  effects  in  the  frac  operation  or  from  numerical  noise  in  the  inverse.  The  frac  oper¬ 
ation  in  frac(N_ 1  n")  can  be  viewed  as  a  componentwise  modulo-one  operation  on  real  numbers, 
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and  we  can  scale  that  operation’s  arguments  up  by  |N|  and  its  output  down  by  |N|  to  compute  the 
frac  operation  as 

frac(N”1n//)  =  ^|N|  N-1™"  mod  |N 

using  a  componentwise  mod  operation.  Product  |N|  N_1  contains  integers  only  and  can  be  rounded 
to  eliminate  numerical  noise  and  thereby  guarantee  that  the  mod  operation  has  a  left  argument  that 
is  numerically  an  integer  vector.  Using  npp  for  n" in  matlab  then,  we  can  describe  the  computation 
n  =  n"  mod  N  using  ([9])  with  the  code 

function  n=columnmod (npp, N) 
adN=abs (det (N)  )  ; 

n= (N*mod (round (adN*inv (N) ) *npp, adN) ) /adN; 

The  final,  integer  division  is  remainderless  because  each  component  of  the  numerator  is  a  multiple  of 
the  denominator.  To  see  this,  note  that  the  mod  operation  on  the  last  line  has  the  effect  of  offsetting 
its  left  argument  by  some  vector  c|N|  where  c  G  %2 .  The  numerator  is  therefore 

N  (|N|N-1n"  +  c|N|)  =  (n"+Nc)|N|.  (11) 

There  are  analogous  definitions  and  computations  for  row  vectors  to  be  decomposed  according 
to  (|2]).  In  this  case,  and  for  convenience  now  taking  Q  to  comprise  row  vectors, 

{Tot*}  4  {k  G  £2 :  k  G  □  N}  (12) 

k  =  k"  mod  N 
k"  mod  N  4  frac(fc//N_1)N 

with  matlab  definition 

function  k=rowmod (kpp, N) 
adN=abs (det (N)  )  ; 

k= (mod (kpp*round (adN*inv (N) ) , adN) *N) /adN; 

When  N  is  just  a  positive  scalar  integer  N,  both  columnmod  ( )  and  rowmod  ( )  degenerate 
correctly  to  the  usual  mod-W  operation. 

Other  remainder-set  choices.  Let  remainder  set  [jt2/Nzf2]  once  again  be  arbitrary.  For  any 
n"  G  there  is  a  unique  decomposition,  according  to  ([!]),  of  the  form 

rt"=Nn'+n,  (13) 

where  n'G  and  n  G  [^2/N^2] .  But  how  do  we  effect  this  decomposition  for  a  given  n"l 
We  already  know  there  is  also  a  unique  decomposition  of  the  form 

n"=  N  n'moi  +  nmod,  (14) 

where  nmod  =  n"  mod  N  and  where  n'iiod  G  can  then  be  solved  for.  Subtracting  (fl3j)  from  (I4|) 
and  solving  for  n  yields 

n  =  Nnoffset+  nmod,  (15) 

where  n'o{fset  =  n'mod  -n'Gf2. 


Relationship  (fl5|)  lets  each  of  n  and  nmod  be  determined  unambiguously  from  the  other.  The 
simple  case  is  when  n  is  known  first,  because  the  uniqueness  of  decomposition  mod  N  then  imme¬ 
diately  yields  the  second  equality  in 

Timod  =  n" mod  N  =  n  mod  N.  (16) 

When  nmod  is  known  first  in  (fT5])  instead,  n  is  known  as  well,  in  principle,  because  if  we  had  two 
possibilities 

Tl\  —  N '"-offset I  T  Wnod 
n2  =  N7T.oljset2  +  nmo(j, 

we’d  have  n\  —  n2  =  N(n[)ffsetl  —  n[)ffsel2),  and  ni  and  n2,  both  elements  of  [^2/N^2],  would 
differ  by  a  right  integer  column-vector  multiple  of  matrix  N,  which  is  disallowed. 

Since  the  map  [^2/N^2]  :  nmod  nTfL,N  defined  by  (|T6j)  is  therefore  one-to-one,  and 

since  the  range  {c™°,fmns}  and  the  image  {  n  mod  N  :  n  G  [^2/N^2]  }  of  the  domain  under  that 
map  have  the  same  finite  size,  the  map  is  onto  as  well  and 

(coUnd  =  {»  “Od  N:  n  €  [?2/N  f2] }.  (17) 

Obtaining  { coIlutths }  in  UZt*  from  [?2/N?2]  in  this  way  is  straightforward,  but  going  in  the 
other  direction  is  possible  as  well.  Because  n  in  (fT5|)  is  uniquely  determined  by  nmo d,  we  can 
determine  it  in  practice  by  a  lookup  table.  Equivalently,  the  lookup  table  could  provide  the  difference 
n  -  nmod  =  Nn'ffset  of  ([15])  or  just  n')fTsct.  Expressing  the  latter  as  n')flset  =  npof  f  set(nmod) 
gives,  using  (p~5|),  the  matlab  computation  of  n  in  (fT3T)  from  n",  or  npp  in  matlab,  as 

nmod=colummod (npp,  N)  ; 
n  =  N*npoffset (nmod)  +  nmod; 


using  the  vector  columnmod  ( )  function  defined  previously  and  predefined  lookup-table  array 
npof  f  set  ( ) .  Once  n  is  thus  determined,  n' can  be  solved  for  to  complete  decomposition  (T3|). 
Having  shown  implicitly  that  an  arbitrary  remainder  set  [Tr/N^2]  can  be  expressed  in  the  form 

[^2/N$2]  =  {N  npof  f  set(nmod)  +  nmod:  nmod  G  {“£}},  (18) 


we  can  easily  verify  that  the  converse  is  true  also,  that  a  set  of  the  form  on  the  right  in  ( 18 1  satisfies 


the  defining  requirements  for  a  remainder  set  [|)2/N^2] ,  no  matter  what  elements  of  |)2  are  returned 
by  npof  f  set  ( ) .  First,  the  set  on  the  right  in  (fl8|)  has  |N|  elements,  because  does  and 

because  adding  the  N  npof  f  set(nmod)  term  to  nmod  in  (p~8|)  cannot  land  the  sum  on  another 
element  of  },  as  elements  of  the  latter  cannot  differ  by  such  a  term.  Second,  the  difference 

of  two  elements  of  the  set  on  the  right  in  (18])  takes  the  form  (N  npof  f  set(nmodl)  +  nmodi)  — 


(N  npoffset(nmod2)+nmod2)  =  N(npof  f  set(nmodi)-npof  f  set(nmodi))+nmodi-nraod2 


and  so  can’t  be  a  right  integer  column-vector  multiple  of  matrix  N  because  nmodi  —  nmod2  can’t. 
The  possible  sets  [^2/N^2J  are  therefore  exactly  those  of  form  (18])  for  the  various  possible  choices 
of  map  npof  f  set  :  {”£s}  ?2 

For  row  vectors  matters  are  analogous.  We  have 


[S2/S2N]  =  {klookup(fcmod)  N  +  kmod:  kmod  G  {mr°0tN}}  (19) 


with  the  corresponding  matlab  for  individual  values: 


kmod=rowmod (kpp, N)  ; 
k  =  klookup (kmod) *N  +  kmod; 

In  the  other  direction, 

{rn^-N}  =  {k  mod  N:  k  G  [S2/^2N]}. 
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2.2  The  2D  lattice  array  from  first  principles 

This  development  applies  to  a  narrowband  array,  in  which  signals  arc  at  (near)  Fourier  radian  fre¬ 
quencies  ±27tc/A,  with  A  the  positive  scalar  wavelength  of  interest. 


A  hypothetical  one-port  array.  Let’s  do  a  thought  experiment.  Suppose  for  the  moment  that  our 
array  has  but  one  output  port,  at  the  spatial  origin  and  from  which  we  take  our  output.  Of  course 
physically  it  has  other  ports,  but  we  can  suppose  them  terminated  to  take  them  out  of  action.  In 
fact  we  can  suppose  our  one  port  to  be  terminated  as  well,  but  we  will  suppose  we  arc  looking  at 
the  voltage  across  the  termination  resistance  and  taking  that  as  our  signal.  In  other  words,  we  arc 
looking  at  the  output  of  a  single  element,  embedded  in  the  array. 

Now  suppose  we  translate  the  entire  array  in  position,  without  changing  its  orientation  in  any 
way,  so  as  to  move  the  one  output  port  from  the  origin  to  position  x,  a  column  vector,  and  let  us  then 
call  that  translated  port’s  output  signal  s(x,  t).  We  can  write  this  in  terms  of  its  spacetime  Fourier 
transform  S(k,  /),  using  row  vector  k  as  the  spatial  transform  variable,  as 

s(x,f)  =  J  HI  S(K,f)e^t+^dKdf  (20) 

where  dK  is  differential  spatial-frequency  volume. 

How  do  we  interpret  Fourier  transform  S(k,  /)?  Output  signal  s(x,  t)  is  necessarily  real,  and 
it  follows  that  S(k,  f)  =  S*(— k,  —  /).  Assuming  then  that  S(k,  0)  =  0  (no  DC  output  to  confuse 
region-of-integration  endpoint  matters  below),  we  can  show,  through  an  annoying  number  of  small 
steps  beginning  with  splitting  the  integral  in  /,  that 


'  — OO 
POO 


s(x,t)=l  /US (K,  /)  ej25T(/t+Kx)  dKdf  + 

JO 

S(-k,  -/)  e~^ft+Kx)  dK  df  +  f 

Jo 

s*{Kj)e-i2^ft+Kx)dKdf  +  r 

Jo 


S{k,  f)  dKdf 

5(K,/)ej27r(/t+Kx)  dK  df 
S(K,/)ejW+Kx)  dK  df 


K,f)eyMft+^}dKdf 

_ .  f  \  j  ft  Kwavenumberx)  \  .if 

"'wavenumber  i  ./  )  '  J  “"-wavenumber  aJ 


This  final  form  looks  exactly  like  the  integral  traditionally  used  to  combine  complex  field  compo¬ 
nents,  each  a  propagating  plane  wave,  to  form  the  real  field.  If  our  one-port  array  were  a  perfect 
polarized  field-measuring  device,  measuring  perhaps  the  field  along  some  one  vector  direction,  then 
that  interpretation  would  apply  directly.  Even  though  it  is  not  really  such  a  device,  it  does  seem  that 
we  arc  integrating  over  frequency  and  wavenumber  to  combine  some  sort  of  propagating  complex 
waves,  with  the  wave  at  wavenumber  vector  —  k  and  frequency  /  >  0  having  complex  amplitude 
2  S(K,f). 

It  is  not  necessary  to  go  into  the  mathematics  here,  but  in  fact  the  difference  between  the  inte¬ 
gral  in  our  one -port  thought  experiment  and  the  traditional  field  integral  is  that  in  our  integral  the 
complex  amplitude  function  2  S(k,  f)  governing  the  amplitudes  and  phases  of  the  waves  has  been 
preprocessed  behind  the  scenes  with  the  embedded  element  pattern.  Because  this  element  prepro¬ 
cessing  is  lineal-  and  both  space  and  time  invariant,  the  Helmholtz  relation  governing  free-space 
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electromagnetic  propagation  will  apply  after  element  processing  just  as  it  does  before,  and  trans¬ 
form  variable  re  will  have  length  | j  re  1 1  =  1/A  just  as  a  wavenumber  vector  would.  The  difference  is 
that  here  re  points,  for  components  having  /  >  0,  not  in  the  direction  of  propagation  but  in  the  DOA. 


We  will  call  re  in  (20 1  the  DOA  wave  vector. 


By  the  nature  of  its  derivation  above  transform  5(re, /)  for  /  >  0  represents  the  element- 
preprocessed  incoming  plane-wave  signal  at  frequency  /  and  wavenumber  —re.  We  next  derive 
the  actual  array  output.  Comparing  the  latter  with  this  one -port  array  output  will  give  us  the  array 
factor  we  seek. 

The  array  factor  and  beam  sum.  If  we  fix  x  =  0  in  ([20])  the  integral  gives  us  the  untranslated 
embedded  element  output.  Let  us  take  that  as  a  reference  signal  to  be  improved  upon  through  the 
creation  somehow  of  a  signal 


X(\Ks,t)  = 


Gideai(K  -  ks)  S(k,  f)  ei27Tft  dK  df. 


(21) 


This  is  just  integral  ([20])  with  x  =  0  and  with  S(k,  /)  replaced  with  Gideai(^  —  ks)S(k,  /).  A 
convenient  normalization  has  been  adopted  for  the  function’s  first  argument,  which  is  really  just  a 
parameter. 

A  quantity  serving  the  role  that  G ideal  ( k,  —  re,s)  does  in  ([2d])  is  traditionally  termed  an  array 
factor.  We  can  suppose  that  the  res  =  0  or  unsteered  array  factor  6)^1  (re)  is  approximately  unity 
when  DOA  wave  vector  re  points  in  some  boresight  direction  and  is  near  zero  when  it  points  well 
away  from  that  direction  so  that — this  is  how  we  express  it — array  factor  Gideai(^)  has  a  boresight 
beam.  In  our  modified  combining  integral  Gideai(^)  actually  appears  translated  by  res,  so  there  the 
beam  occurs  for  re  «  res  instead  of  for  re  ~  0. 

How  do  we  create  a  signal  that  will  be  described  by  such  a  modified  combining  integral?  What 
we  want  is  4D  Fourier-pair  relationship 

X(Xks,  t )  =  (mystery  function  of  x  and  f)  |  , 

(mystery  function  of  x  and  t)  <->  Gjdca|(re  —  res)5(re,  /), 

because  sampling  at  the  origin  in  one  domain  is  equivalent  to  integration  in  the  other.  The  convolu¬ 
tion  property  of  the  transform  and  ID  Fourier  pair  5(f)  <->  1  then  yield 

(mystery  function  of  x  and  f)  =  (5(f)  (mystery  function  of  x))  *  s  (x,t) 

Gideal(^  /)? 

where  (mystery  function  of  x)  <->  Gjdeai(^  —  res).  We  can  proceed  further  along  these  lines  by  using 
the  modulation  property  of  the  transform  to  write 


(mystery  function  of  x)  =  e 


_  j271-ksx 


9ideal(^)  ^  G/deal^ 


with  5ideai(^)  Cjdeai(K)-  Next,  since  <7ideai(x)  is  a  spatial  impulse  response  we  are  convolving 
with  the  input  signal  in  a  sort  of  spatial  filtering  operation,  let’s  specialize  to  the  simplest  kind  of 
filter  to  design,  a  finite-impulse-response  or  FIR  filter:  we  require  that  <7ideai(x)  comprise  a  finite 
number  of  impulses.  For  convenience  we  will  make  the  number  formally  infinite  but  with  all  but  a 
finite  number  of  their  areas  {gn"}  nonzero.  Then  we  can  write 


t/idcal ( ^  ^  (  9n"  5(x  >  Gjdeal(^)' 


(22) 
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As  far  as  this  development  is  concerned,  those  weights  { gn "}  that  are  nonzero  might  all  be  unity,  as 
for  an  untapered  beam,  or  they  might  be  strictly  positive,  or  not,  or  they  might  even  be  complex.  It 
is  a  system-design  issue  unrelated  to  the  steering  matters  at  hand. 

So  far  then  we  have 

(mystery  function  of  x  and  t)  =  ( 5(t )  (mystery  function  of  x))  *  s  Cm) 

=  (5(f)  ej27rKsX  gideal(K))  *  s(x,  t) 

=  (5(f)  ej2™sX  ^2  gn»  5(x  -  xn„))  *  s(x,  f) 

n" 

=  (5(f)  ^2  ej27rKsX""  gn"  5(x  -  x„»))  *  s(x,  f) 

n" 

=  ^ej2™sX""  gn»  s(x  -  xn//,f), 

n" 

which  when  evaluated  at  x  =  0  becomes  our  desired  array  output 

X(\Ka,t)  =  ^ej27rKsX""  gn»  s(-x„»,f).  (23) 

n" 

An  array  output  is  termed  a  beam  or  beam  sum  in  hopes  of  a  particular  functionality. 

The  array  structure  must  be  periodic  in  a  lattice  sense.  The  sum  ([23])  depends  on  the  output  of 
our  one -port  array  when  the  array  is  translated  by  each  — xn//,  in  effect  requiring  us  to  have  access  to 
many  different  translated  arrays  simultaneously.  This  actually  requires  only  that  the  original  array 
and  the  array  translated  by  an  arbitrary  one  of  the  —  xn»  are  identical.  Then  since  the  array  has  an 
output  port  at  the  origin,  it  must  have  one  at  — xn//  as  well.  Both  arrays,  untranslated  and  translated, 
exist  simultaneously  for  our  purposes  if  we  observe  the  signals  across  both  terminations.  Taking  it 
further,  the  array  must  be  physically  unchanged  by  translations  from  set  X  =  {— x„»  :  n"  G  zp2  } 
and  must  have  a  port  at  each  element  of  X. 

Now  suppose  we  translate  the  array  twice  in  succession,  first  by  one  vector  from  X  and  then 
by  another.  Since  neither  translation  changed  the  array,  and  since  the  net  translation  is  the  sum  of 
the  two  translations,  we  see  that  if  the  array  is  unchanged  by  translations  from  X,  then  it  is  in  fact 
unchanged  by  translations  from  the  algebraic  closure  of  X  under  vector  addition.  We  may  as  well 
start  off  by  requiring  X  to  be  closed  under  vector  addition,  and  we  can  just  set  gnn  =  0  for  any 
member  of  X  not  actually  needed.  Since  there  is  an  output  port  at  each  point  in  X,  and  since  a 
port  occupies  some  physical  space,  the  set  X  must  be  discrete,  that  is,  each  point  in  it  must  be  at 
least  some  positive  minimum  distance  from  all  other  points.  We  can  assume  that  set  X  contains  the 
origin,  because  we  know  there  is  a  port  there.  If  translating  by  an  element  of  X  did  not  change  the 
array,  then  neither  does  translating  by  the  negative  of  that  element  to  get  back  to  where  we  started, 
so  X  is  closed  under  negation.  A  discrete  subset  of  real  Euclidean  space  that  contains  the  origin  and 
is  closed  under  negation  and  summation  of  its  vectors  is  a  lattice  |[6j|. 

Aside:  In  this  paper  a  grid  is  just  a  less  formal  name  for  a  lattice. 

That  this  array  is  unchanged  by  translation  by  elements  of  a  lattice  means  it  is  periodic  in 
structure  and  has  infinite  extent,  but  is  it  infinite  in  one,  two,  or  three  dimensions?  We  will,  of 
course,  extend  the  physical  structure  only  some  maximum  distance  out  from  the  origin  and  just 
use  “guard  elements”  around  the  boundary  to  fool  the  elements  whose  outputs  we  are  using  into 
behaving  electrically  as  if  they  are  in  an  infinite  array.  But  even  with  this  bounding  of  the  array 
extent,  the  3D  version  appears  impractical,  because  given  a  plane  wave  propagating  through  the 
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structure,  the  power  out  of  every  port  would  approach  zero  because  of  shadowing  “upstream”  by  an 
infinite  number  of  other  ports,  each  of  which  would  have  a  termination  consuming  power  from  the 
wave.  The  most  general  structure  available  to  us  then  is  a  2D  lattice  of  elements.  The  discussion 
below  will  assume  such  a  structure  and  specialize  the  notation  to  it,  though  redoing  it  all  for  the 
uniformly  spaced  line  array,  which  uses  a  ID  lattice,  would  not  be  difficult. 


The  element-location  lattice.  Let  bi  and  b2,  each  a  three-element  column  vector,  be  linearly 
independent,  dimensionless  basis  vectors,  and  then  form  from  them  element-location  basis  matrix 
B  =  [bi,  b2]  -  The  basis  terminology  is  justified  because  the  smallest  lattice  including  these  two 
vectors,  in  other  words  the  smallest  set  including  these  two  vectors  that  is  algebraically  closed  under 
negation  and  vector  addition,  is  the  set 


B^1 2, 


the  integer-weighted  sums  of  the  basis  vectors.  We  can  scale  this  lattice  by  A  to  give  its  elements 
units  of  lcngtlj^jand  then  set  location  xn//  =  ABn".  For  notational  economy  let  us  further  define 
element  output  s  n ,  indexed  by  the  two  integers  of  element  index  n"  e  to  be  the  signal  taken 
from  the  output  port  at  nominal  element  location  —  xn»  =  — ABrF'so  that  s  »  =  s(— xn»,  t).  Our 
signals  are  functions  of  time  but  for  economy  of  notation  we  will  leave  the  time  dependence  of  sn" 
implicit  as  time  does  not  enter  into  the  beamsteering  discussion  below. 

The  element  locations  are  all  contained  in  the  linear  span  of  {bi,b2},  which  span  is  there¬ 
fore  termed  the  array  plane.  We  term  BZ2  and  AB^2  the  normalized  and  unnormalized  element- 
location  lattices  respectively. 


Normalized  variables  for  the  lattice  beam  sum  and  array  factor.  We  can  use  (|22])  to  obtain  the 
unsteered  array  factor  Gideai(K)  governing  the  integral  combination  in  (|2T|).  Using  <±x  for  differential 
spatial  volume, 


Gideal^)  — 


in  terms  of  a  function 


n" 

J>n"e-j2™ABn 


gnn  5(x  —  ABn")  )e  j27rrexcpc 


n 


// 


=  G(kAB) 


G(/)  = 


A  \  “ 


q  n  e 

3  n. 


-j27 vfn" 


(24) 


where  /  is  a  2D  row-vector  normalized  frequency.  We  recognize  (|24|)  as  the  discrete-time  Fourier 
transform  in  two  dimensions  of  weights  gnii .  Since  the  number  of  nonzero  weights  is  finite,  it  is 
also  the  frequency  response  of  an  FIR  filter  with  weights  gn".  Steered  array  factor  Gideai(^  —  ks) 
now  specializes  to  G((k  —  ks)AB). 

Flelmholtz  relation  |  k  |  =  1/A  makes  normalized  DOA  wave  vector  n\  a  unit  vector,  and  it 
is  natural  to  make  normalized  DOA  steering  vector  /tsAa  unit  vector  as  well  so  that  the  beam  of 
G((k  —  ks)AB)  points  in  some  physical  direction  ksX.  Let  us  create  notation  for  these  unit  vector^ 


1  In  the  Allerton  paper  0  the  basis  itself  has  length  units:  the  quantity  AB  of  this  paper  is  the  Allerton  paper's  Bc, 
the  computational  representation  of  the  element-location  basis  B  that  plays  such  a  central  role  in  the  development  there. 

2Expression  —  k  here  is  the  k  of  the  Allerton  paper  0. 

A 
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and  further  decompose  them  into  components  in  and  normal  to  the  array  plane: 


Xk  =  K 


tk  T  n  normal 


Aks  =  k"  = 


k"+  k 


// 

normal  * 


Our  steered  array  factor  G((k  —  ks)AB)  now  becomes  C ( ( k  —  k")B),  in  which  the  normal  com¬ 
ponents  were  lost  in  the  product  with  B,  the  columns  of  which  are  in  the  array  plane.  This  array 
factor  is  often  plotted  against  the  direction  cosines,  the  components  of  normalized  array-plane  DOA 
wave  vector  k  in  orthogonal  “azimuth”  and  “elevation”  directions  in  the  array  plane.  (Precisely  for 
convenience  in  plotting,  notation  for  those  components  will  be  developed  explicitly  in  Section[3]) 
Beam  sum  (|23|)  can  be  written  now  in  terms  of  element-location  basis  matrix  B  and  normalized 
array-plane  DOA  steering  vector  k"  as 

X(\KS)  =YJ^~'Bn"  9n"  Sn"  =  X(k").  (25) 

n" 

The  normal  component  of  the  steering  vector  has  been  dropped  on  the  right  as  irrelevant. 


A  delay  interpretation. 

steered  weight  ej27rk"Bn 
delay 


We  can  get  a  check  on  the  formulation  of  the  beam  sum  by  rewriting 
"gnii  in  ([25])  as  e_j27rkr(k"X') gn„,  where  frequency  /  =  c/A  and  where 


r(k",n")  A  -  Ak"Bn"/c  =  -  (-k")(-ABn")/c, 


a  dot  product  with  a  unit  vector  that  reveals  the  delay  to  be  an  advance  (negative  delay,  using  the 
first  minus  sign  on  the  right)  of  the  signal  from  the  element  at  position  -ABn"  by  the  free-space 
propagation  time  across  the  component  of  that  position  along  wave-propagation  direction  —  k.  This 
advancing  in  time  counters  the  time-of-flight  delay  for  the  wave  to  reach  element  n"  relative  to 
when  it  reaches  the  spatial  origin. 


2.3  Many  Beams  from  the  DFT 

The  steering  lattice.  What  makes  steering  with  a  DFT  possible  is  also  the  main  limitation  of 
the  approach:  the  family  of  allowable  normalized  array- plane  DOA  steering  vectors  k"must  be 
chosen  in  a  particular  way.  Let  us  begin  by  attempting  to  choose  it  so  as  to  undo  or  cancel  the 
effect  that  element-location  basis  matrix  B  has  on  steering  exponential  ej27rk"Bn".  One  ordinarily 
cancels  a  matrix  with  its  inverse,  but  of  course  we  cannot  invert  3x2  matrix  B.  However,  since  the 
basis  vectors  in  B  are  linearly  independent,  Gram  matrix  B7  B  is  assuredly  invertible,  and  we  do 
therefore  have  use  of  the  Moore-Penrose  pseudoinverse  B  that  is  computed  by  matlab’s  pinv  ( ) 
function  and  defined  by 

B+  A  (BTB)-1BT,  (26) 

The  usefulness  of  the  pseudoinverse  (here)  is  in  the  property  B+B  =  (B7  B)_1B7  B  =  I.  So  let 
us  set  steering  vector 

k"  =  —  fc//N_1B+,  (27) 

where  N  is  a  nonsingular  2x2  integer  matrix,  the  design  of  which  will  be  explored  in  Section  [3] 
The  quadratic  form  in  steering  exponential  ej27rk"Bn"now  becomes 

k"Bn"=  -fc"N-1(BrB)-1BrBn" 

=  —  fc"N_1n", 
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and  we  have  not  only  eliminated  B,  we  have  replaced  it  with  —  N  .  (The  minus  sign  will  prove  to 
be  a  minor  convenience.) 


What  makes  this  interesting  is  the  Section  2.1  material  on  remaindered  division  by  an  integer 
matrix,  because  substituting  decomposition  Q  into  (27 )  results  in 


k"B  n"=  ~{k'  N  +  fc)N_1n" 
=  —k'n"—  fcN_1n", 


which  makes  the  steering  exponential  itself  just 


j27rk"Bn" }2irk'n" j27rfeN  1n" p—  j27rfcN 


=  e 


(28) 


The  magic  here  is  that  all  periodicity  has  disappeared  with  the  factor  ei27Tk'n'\  because  any  two 
choices  for  k  E  [Z2/Z2N],  say  k\  and  &2,  result  in  e-J27rfciN  ln"  =  e-j27rfc2N  ln"  ^  anc[  oniy  jf 
fciN_1n//—  A^N1  n"  is  an  integer,  and  it  is  an  integer  for  all  n"  E  %1  if  and  only  if  fc|N  1  — 
&2 N”1  E  Z2.  But  if  fciN-1  —  =  Aioffset  F  Z2,  then  k\  —  &2  =  ^offsetN,  which  form  is 

disallowed  unless  k\  =  &2-  So  when  decomposition  ([2])  is  used  for  k"  in  steering- vector  choice 
([27]),  we  can  think  of  k! as  selecting  which  steering  period  we  are  in  and  k  as  selecting  a  point  within 
that  period.  We  have  |N|  steering- vector  choices  within  the  steering  period  through  our  choice  of 
k  E  [Z2/Z2  N] ,  so  in  this  context  let  us  term  N  the  steering  density  matrix. 

A  very  special  case  is  when  steering-vector  choice  (|27|)  places  k"  in  Z2N,  a  subset  (sublattice 
in  fact)  of  Z2.  In  that  case  (|27|)  places  normalized  array-plane  DOA  steering  vectors  k"  in  the  set 


2-R  + 


Z2B 


which  comprises  integer  combinations  of  the  two  vectors  given  by  the  three- vector  rows  of  B+. 
Those  rows  necessarily  lie  in  the  array  plane,  because  definition  (|26|)  sets  each  of  those  rows  to 
a  linear  combination  of  the  basis  vectors  in  B.  Further,  the  rows  of  B+  are  linearly  independent, 
because  given  any  row  vector  c  of  coefficients  to  combine  them,  0  =  cB+  implies  0  =  cB+B  =  c. 
The  rows  of  2  x  3  matrix  B+  therefore  form  an  alternative  basis  for  the  array  plane,  and  Z2B  "  is  a 
lattice,  here  termed  the  boresight  replication  lattice.  (The  name  will  gain  meaning  in  Section[3]) 
Mathematically,  Z2B+is  the  dual  of  lattice  B^2(and  B^2is  the  dual  of  Z2B+),  so  here  let  us 
term  B+the  dual  basis  matrix.  Further,  in  ([27])  the  two  integers  of  row  beam  index  vector  k"  E  Z2 
govern  the  choice  of  normalized  array-plane  DOA  steering  vector  k"  as  an  integer  combination 
of  the  two  basis  vectors  given  by  the  three-vector  rows  of  — N”1B+.  Matrix  N~  1  is  of  course 
invertible,  so  the  rows  of  — N_1B+  arc  linearly  independent  and  also  form  a  basis.  The  lattice 
Z2N”1B+  of  row  vectors  in  the  array  plane  that  they  specify  is  the  steering  lattice.  The  particular 
form  we  have  given  it  here  will  lead  to  a  DFT  next  and  then  to  an  FFT  in  Section  [4] 

DFT  Beamsteering.  Specializing  beam  sum  (|25[)  to  use  the  particular  normalized  array-plane 
DOA  steering  vector  k"of  steering  specification  ([27])  yields 

Xk„  kX{-k" N~1B+) 
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(29) 


—  22  e-*  gn "  sn" 

n" 

E0-j27rfc"N^1ra''  „ 

6  9n"  Sn"  ■ 


Let  us  use  decomposition  <|2])  for  k" as  discussed  above,  with  remainder  set  [Z2/Z2N]  arbitrary  (un¬ 
til  the  design  examples  of  Section[3j),  and  let  us  also  use  decomposition  ([!])  for  n"  with  [^2/N^2]  = 
{columns}’  a  choice  that  is  not  necessary  but  that  is  convenient.  Using  (|28|)  and  repeating  the  idea 
for  n", 

pj27rk"Bn" j27rfcN~1(Nn/+n)  g— j27rfcn/^— j27rfcN~ 1n  g— j27rfcN~1n 


using  which  beam  sum  ( 29 1  becomes 


X,n  = 


E 


E 

r  mod-N  | 
IcolumnsJ 


— j2-7rfcN  1n 


^Nn'+n  ^Nn'+n 


^  ^  j27rA;N  1n 

ne{  mod-N  | 
IcolumnsJ 


}  1  t/N n'+n  %n'+n 
n' 


(30) 


While  the  left  side  of  (|30|)  is  indexed  on  k"  =  k' N  +  k,  it  is  not  actually  dependent  on  k!  in  any 
way,  so  Xk"  is  not  changed  by  adding  an  offset  of  form  fc^N  to  k" ,  where  k±  E  Z2  is  arbitrary. 
This  simply  reflects  that  the  array  factor  is  periodic  in  the  steering  vector.  We  can  therefore  use 
these  two  decompositions  for  A:" and  an  additional  relationship  from  (fT6|) 


k"=  fe'N  +  k 
k"=  A/modN  +  k"  mod  N 
A;"  mod  N  =  k  mod  N, 


to  assert  that 

E ^ a  —  Xkn  modN  —  -E  k  mod  N  —  -E k  . 

used  in  1  used  only  to  T  used  jn 

system  design  equate  the  others  FFT  design 

We’11  use  the  left  form  for  array  system  design  in  Section  [3]  the  right  form  for  FFT  design  in 
Section  [4]  and  the  middle  two  as  a  convenient  way  to  match  up  those  two  beam-indexing  systems. 
In  other  words,  the  beam  output  we  need  and  the  beam  output  we  are  computing  to  meet  that  need 
must  have  the  same  index  modulo  N. 

With  this  understood  then,  we  can  write  the  general  result  as 


-Efc modN  =  Y,  Xn  e_j27rfcN_ln, 

(31) 

ng|  mod-N  | 

IcolumnsJ 

%n  =  ^^Nn'+n^Nn'+ri) 

(32) 

revealing  the  beam  output  as  a  function  of  beam  index  k  mod  N  to  be  a  generalized  DFT  of  |  N 


n  E 


{columns}}’  where  the  folding  sum  has  combined 


weighted  and  folded  element  outputs  |x 
those  weighted  element  outputs  that  have  the  same  indices  mod  N.  It  is  this  folding  in  ([32])  that 


frees  us  from  any  requirement  that  the  size  of  the  array  match  the  size  of  the  transform. 
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unnormalized 

normalized 

, - ~ - V 

array  plane 

beam  index 

unit  visible  whole 
vector  region  plane 

gross 

DFT 

wave 

DOA 

K 

K  K  K 

steering 

DOA 

Ks 

k"  k"  k" 

k" 

k 

Table  1:  The  frightening  menagerie  of  notations  relating  to  DO  A  (including  some  yet  to  be  intro¬ 
duced  in  the  text). 


Periodicity  of  the  weighted  and  folded  element  outputs.  If  we  were  to  remove  the  restriction  of 
to  {columns}  in  (32),  the  weighted  and  folded  element  outputs  xn  would  in  fact  be  well  defined 


n 


for  all  n  e  f?.  No  information  is  lost  in  restricting  n  to  { 
demonstrate  periodicity  of  xn  in  n  makes  other  values  redundant. 


columns}  however,  because  the  easy-to- 


3  System-design  examples 

This  section  develops  the  basics  of  a  design  discipline  for  a  DFT-steered  array  by  working  through 
a  series  of  examples,  beginning  with  classic  ones. 

The  many  DOA  notations  used  in  the  discussion  to  follow  are  somewhat  bewildering,  so  they 
are  summarized  for  convenience  in  Table  Q] 

A  basis  for  direction  cosines.  Heretofore  the  array  plane  has  been  the  span  of  the  basis-vector 
columns  of  B,  but  these  in  general  are  not  orthogonal.  Of  course  the  rows  of  dual  basis  matrix 
B+  =  (BtB)"1Bt  are  linearly  independent  combinations  of  the  basis-vector  columns  of  basis 
matrix  B,  so  the  rows  of  B+also  span  the  a  nay  plane  and  could  be  taken  as  a  basis  for  it,  but  those 
rows  may  not  be  orthogonal  either.  As  it  is  convenient  to  have  an  alternative,  orthogonal  array- 
plane  basis  for  plotting,  sketching,  and  discussion,  let  the  first  and  second  columns  respectively  of 
3x2  direction-cosine  basis  matrix  P  be  orthogonal  array -plane  unit  vectors  pointing  in  the  azimuth 
and  elevation  directions  normal  to  the  boresight  direction.  If  we  assume  the  array  is  in  a  vertical 
plane  and  that  we  are  at  the  array  and  looking  outward  and  generally  towards  the  horizon  along 
the  boresight  direction,  these  azimuth  and  elevation  directions  are  to  the  right  and  upward.  We 
will  generally  plot  the  location  of  an  arbitrary  three-vector  array-plane  row  vector  Karb  using  its 
(azimuth,  elevation)  pair  of  generalized  direction  cosines  k^P.  These  are  the  traditional  direction 
cosines  of  course  when  the  sum  of  their  squares  is  upper  bounded  by  unity,  as  it  is  for  kP  for 
example. 

3.1  The  Classic  A/2  square-grid  design 

Array-Factor  Plots.  The  Fig.  [2]  example  plot  of  the  magnitude  of  an  array  factor  illustrates  plot 
conventions  and  key  array-factor  features. 

Plots  of  array  factors  in  this  paper  do  not  “go  with’’  particular  DFT-beamsteering 
designs  in  any  aspect  other  than  large-scale  array-factor  periodicity.  Other  array- 
factor  features,  including  beamwidth,  can  be  designed  independently  of  steering  con¬ 
siderations,  The  array  factors  shown  here  are  therefore  only  for  general  illustration. 
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Figure  2:  Example  circular-beam  array  factor  for  a  A/2-spaced  square-grid  array,  plotted  as  color- 
coded  magnitude  versus  the  generalized  direction  cosines  kP of  array-plane  vector  k  that  becomes 
normalized  array-plane  DOA  wave  vector  k  inside  the  visible  region  represented  here  by  a  unit- 
radius  world  map.  Lower  right:  example  of  nonzero  steering.  Other:  unsteered,  at  three  zoom 
levels. 


( They  were  optimized  using  second-order  cone  programming,  taking  advantage  of  the 
symmetry-based  simplifications  discussed  in  the  Allerton  paper  m i  and  using  the  soft¬ 
ware  discussed  in  /[7|/. ) 

In  Fig.  [2] the  decibel  array-factor  magnitude  201og10  | G((k  —  k")B)|  is  represented  by  color 
and  is  plotted  versus  the  generalized  direction  cosines  kP  of  normalized  array-plane  vector  k  with 
the  origin  at  the  center  of  the  unit-radius  “world  map!’ 

The  circular  ||re||  <  1  interior  of  that  world  represents  what  is  traditionally  termed  the  visible 
region  of  the  array  plane.  Inside  the  visible  region  k,  can  be  taken  as  the  projection  k  of  a  normal¬ 
ization  k  =  Ak  of  physical  DOA  wave  vector  k,  and  it  is  the  array  factor  at  these  k  values  that 
governs  the  array’s  response  to  signals.  Inside  the  visible  region  the  Cartesian  plot  position  gives 
the  traditional  direction  cosines  kP  corresponding  to  the  unit  vector  k  with  actual  azimuth  and 
elevation  angles  given  respectively  by  what  appeal's  on  the  world  map  to  be  east  longitude,  relative 
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Figure  3:  This  is  the  classic  Nyquist  design  on  a  square  grid.  The  rows  of  dual  basis  matrix  B+, 
plotted  here  as  long  vectors  using  generalized  direction  cosines  B  1  P,  determine  array-factor  peri¬ 
odicity.  The  columns  of  basis  matrix  B  are  plotted  here  as  short  vectors  using  generalized  direction 
cosines  P7  B  and  are  scaled  by  A  to  determine  unnormalized  element-location  lattice  AB.'|i2.  Here 
the  columns  of  B  are  orthogonal  with  length  1/2  and  so  indicate  A/2  element  spacing  on  a  square 
grid. 
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Figure  4:  The  choices  for  steering 
vector  k"  the  steering  lattice,  have 
generalized  direction  cosines  k"P 
marked  here  by  grid-line  intersections 
|  .  That  lattice  must  contain  the 
basis- vector  rows  of  dual  basis  matrix 
B+  plotted  here  as  generalized  direc¬ 
tion  cosines  B+P.  Coordinates  k"  = 
[  |  ,  — ]  of  each  such  vector  in  terms 
of  the  |  and  —  grid  lines  determine 
steering  density  matrix  N.  An  array- 
factor  period's  worth  of  the  steering 
lattice,  those  with  beam-index  vectors 
k" in  [Z2/Z2N] ,  are  marked  with  dots 
(solid  or  hollow).  FFT  outputs  need 
not  be  implemented  where  those  steer¬ 
ing  vectors  fall  outside  (hollow  dots) 
the  steering  region. 


to  the  vertical  meridian,  and  north  latitude. 

Planar  steering  vector  k"is  a  parameter  and  is  zero  in  Fig.  [2]  except  for  the  plot  on  the  lower 
right,  which  illustrates  that  steering  amounts  to  simple  translation  in  n.  Plotting  versus  azimuth  and 
elevation  angles  instead  would  have  destroyed  this  elegant  translation  property. 

Steered  array  factor  G((k  —  kr,)B)  governs  signal  response  for  visible  k  but  is  certainly  well- 
defined  for  arbitrary  Further,  this  array  factor  is  periodic  in  k,  because  FIR  response  G(f)  in  (|24|) 
is  periodic  in  /,  the  offsetting  of  which  by  any  row  two-vector  of  integers  leaves  G(f)  unchanged. 
In  array  factor  G((k  —  k7/)B)  offsetting  k,  by  any  element  of  Z’2B  "  offsets  f  =  (k  —  k")B  by 
an  integer  two  vector,  because  B+B  =  I.  The  unsteered  array  factor  G'(kB),  for  example,  is  for 
any  k  E  Z2B+  the  same  as  for  k,  =  0.  the  center  of  the  boresight  beam.  The  row-vector  elements 
of  boresight  replication  lattice  Z2B  1  are  the  centers  of  the  periodically  replicated  beams  in  the 
lower-left  plot  in  Fig.  [2] 

The  array  factor  for  invisible  k  is  irrelevant  to  signal  response,  but  its  values  at  such  k,  are 
very  relevant  to  noise  analysis,  as  when  identical  individual  preamplifiers  are  used  at  each  element 
to  set  the  system  noise  figure,  the  front-end  noise  they  produce  results  in  array  rms  output  noise 
proportional  to  the  rms  value  of  the  array  factor  over  an  entire  period — for  example  the  single¬ 
period  area  covered  by  the  upper-left  plot  in  Fig.  [2] — and  not  just  over  the  visible  region.  The 
invisible  response  therefore  directly  impacts  SNR  gain  or,  equivalently,  aperture  efficiency  or  taper 
loss,  and  for  that  reason  the  array-factor  optimization  should  generally  be  configured  to  extend  the 
sidelobe  region  outside  the  visible  region  and  across  the  invisible  region. 

Grating-lobe  prevention.  Suppose  the  bright-blue  part  of  the  visible  region  in  the  Fig.[3]diagram 
is  a  steering  region  comprising  all  values  of  steering  vector  k"  all  possible  beam-center  values,  to 
be  used  in  a  particular  system  design.  Of  course  with  DFT  beamsteering  it  would  be  a  discrete  set, 
but  here  it  is  approximated  as  a  simply  connected  region  for  convenience.  When  steering  places 
the  beam  center  on  this  steering  region’s  edge  the  beam’s  transition  region,  the  rolloff  of  the  array 
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factor  to  the  sidelobe  level,  will  extend  outside  the  steering  region.  The  additional  area  covered 
by  that  transition  region  as  steering  vector  k"  traverses  the  steering  region  is  sketched  in  pale  blue 
and  here  will  be  termed  the  steering  penumbra.  The  steering  region  and  penumbra  are  duplicated 
periodically  in  k,  because  the  entire  array  factor  is.  If  any  of  the  invisible-region  periodic  replicas 
of  the  steering  region  with  penumbra  intrudes  into  the  visible  region  at  any  point,  then  there  is 
some  allowable  normalized  array-plane  steering  vector  k"  that  will  result  in  some  part  of  the  replica 
beam  becoming  visible,  in  which  case  that  replica  beam  is  termed  a  grating  lobe  (standard  term).  A 
replica  beam  that  does  not  so  intrude  might  be  called  a  potential  grating  lobe  (nonstandard  term). 

Many  components  of  array  cost  are  directly  proportional  to  the  density  of  elements  in  the  array 
plane.  The  normalized  element-location  lattice  BZ2  has  density 


1 


(33) 


per  unit  square,  so  the  density  of  unnormalized  element-location  lattice  AB|F  is  l/A2of  this  value. 
Generally  a  system  designer  wishes  to  minimize  this  density,  or  equivalently,  to  maximize  the  den¬ 
sity  of  replicated  steering  regions  in  the  plane.  The  usual  approach  involves  four  steps. 


1 .  System  requirements  for  steering  the  beam  determine  the  size  and  shape  of  the  steering  region 
within  the  “world”  coordinate  system. 

2.  The  beam's  rolloff  characteristics — these  can  be  estimated  prior  to  actual  beam  design — 
determine  the  boundaries  of  the  larger  steering  penumbra. 

3.  One  chooses  the  two  rows  of  B+,  vectors  in  the  array  plane,  so  that  replicas  translated  by 
the  elements  of  Z2B+  will  be  packed  as  densely  as  possible  in  the  plane  without  replica 
penumbras  intruding  into  the  visible  region.  Sometimes  a  priori  constraints  on  grid  geometry 
must  be  considered  here. 

4.  Finally,  the  basis  vectors  for  element  positions  Bzfr  are  determined  by  solving  definition 
B+  =  (B/B)_1Br  for  B  in  terms  of  B+.  To  do  this,  and  using  notation  B+/  for  (B+)r, 
first  invert  B+B+T  =  ((BTB)-1BT)  (B(BrB)-1)  =  (BTB)“1  to  obtain  (B+B+T)-1  = 
B7B.  Then  multiply  the  latter  on  the  right  by  definition  B+  =  (B7  B)  1 B7  to  obtain 
(B+B+t)~1B+  =  B7B  (B7B)_1B7  =  B7.  Transposing  gives  the  desired  result,  B  = 
B+t(B+B+t)  \  which  is  just  B  =  B+T+T.  (This  remarkably  awkward  notation  becomes 
just  B  =  B++  once  we  agree,  as  is  matlab’s  pinv  ( )  convention,  that  for  wide  matrices, 
taking  the  pseudoinverse  begins  and  ends  with  transposition  and  takes  the  pseudoinverse  of  a 
tall  matrix  in  between.) 

The  classic  example  is  that  of  Fig.  [3]  where  the  steering  region  and  penumbra  just  fill  the  visible 
region  and  where  an  a  priori  design  constraint  in  favor  of  a  square  element-layout  grid  dictates  that 
the  rows  of  B+be  chosen  to  be  orthogonal  and  of  equal  vector  length  so  that  the  columns  of  B 
that  govern  element  placement  will  be  orthogonal  and  of  equal  length  as  well.  Then  the  choice  of 
B+  rows  that  will  make  boresight  replication  lattice  Z2B+  as  dense  as  possible  are  those  shown 
in  Fig.  [3]  Of  course  those  row  vectors  could  be  reversed,  individually  negated,  or  rotated  together 
without  disturbing  the  density  relationships.  The  particular  choice  made  here,  however,  will  lead 
shortly  to  convenient  beam  indexing. 
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Nyquist  element  spacing.  In  this  example  replicating  the  world  map  periodically  would  produce 
a  universe  with  an  infinite  number  of  worlds  optimally  packed  in  the  plane,  touching  but  not  over¬ 
lapping  and  as  dense  in  the  plane  as  possible  given  the  requirement  that  boresight  replication  lattice 
Z2B+  and  element-location  lattice  Bf2  be  square  grids.  When  such  optimal  packing  of  visible- 
region  worlds  obtains,  the  elements  are  said  to  be  Nyquist  spaced. 

We  can  use  our  direction-cosine  basis  matrix  P  to  go  from  the  graphical  image  in  Fig.  [3]  to  a 
mathematical  formulation  and  write 


B+  = 


-2 

0 


and 


Here  the  columns  of  B  that  govern  the  placement  of  elements  on  points  of  AB|i2  correspond  to  the 
classic  Nyquist  A/2  nearest- neighbor  spacing  on  a  square  grid. 

Nyquist  spacing  is  a  useful  reference  point,  but  it  is  optimal  only  in  the  special  case  in  which 
the  steering  region  and  penumbra  together  exactly  fill  the  visible -region  world,  as  in  this  example. 


The  classic  steering  lattice  for  the  classic  array.  Above  we  removed  the  confinement  of  spatial 
frequency  k  to  the  visible  region  and  examined  the  behavior  of  array  factor  G'((k,  —  k")B)  as  a 
function  of  the  unrestricted  spatial  frequency  k.  Though  we  may  not  find  it  useful  to  steer  outside 
the  visible  region,  the  mathematics  and  any  associated  realization  will  certainly  not  object  to  us 
doing  so,  so  we  can  likewise  remove  the  confinement  of  steering  vector  k"  to  the  visible  region  and 
consider  steering  beam  sum  (|3T|)  with  an  unrestricted  steering  vector  k"  obtained  by  removing  the 
visible-region  restriction  from  (|27j)  to  obtain 

k"  =  -fc"N_1B+.  (34) 


Then  of  course  (34 1  steers  array  factor  G((k  —  k")B)  as  well. 

A  valuable  alternative  interpretation  of  <(34])  is  that  steering  lattice  Z2N_1B+is  a  superlattice  of 
boresight  replication  lattice  Z2B+.  This  is  so  because  an  arbitrary  element  fcar|-,B  1  of  Z2B  2  is  equal 
to  (fcarbN)N-1B+and  is  thus  a  member  of  Z2N_1B+  also.  To  choose  steering  density  matrix  N, 
one  must  therefore 


1. 

2. 

3. 


choose  a  steering  lattice  that  is  a  superlattice  of  the  boresight  replication  lattice, 
choose  two  points  of  steering  lattice  Z2N“ 1 B  1  that  can  function  as  a  basis  for  it, 


taking  those  steering-lattice  basis  vectors  to  be  the  rows  of  steering-lattice  basis  matrix  N-iB+ 
determine  the  beam  index  ^'required  to  steer  k"  to  each  basis  vector  row  of  B  N  n  turn  using 
d34l),  and 


4.  solve  for  N. 


Let  us  examine  the  last  step  more  closely.  Suppose  we  have,  as  per  step  [3]  chosen  the  beam 
indices  fc"that  will  steer  the  now  unrestricted  beam  to  each  of  [l  0]  B+  and  [0  l]  B+,  the  row  basis 
vectors  of  the  dual  basis  matrix  B+  for  the  boresight  replication  lattice  Z2B+  that  governs  array- 
factor  periodicity.  By  (|34|)  we  then  have 


[1  0]B+  =  —  fe/1/N_1B+ 
[0  l]B+  =  -k^N~1B+, 


which  of  course  is  just  the  single  matrix  equation 


B+ 


k'[ 

K 


N_1B+. 
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Multiplying  on  the  right  by  BN  and  using  B+B  =  I  results  in 


N  =  - 


k'[ 

k", 


(35) 


so  the  required  choices  of  k'(  and  fc"  are  just  the  rows  of  — N.  This  relationship  is  more  useful  in 
form  ([35]),  however,  because  there  it  informs  our  choice  of  steering  density  matrix  N. 

Now  consider  the  Fig.  [3]  design  in  particular,  and  suppose  we  create  a  steering  lattice  with 
12  2  =  144  beams  per  array-factor  period  by  assigning  the  first  and  second  rows  of  dual  basis  matrix 
B+  the  beam-index  vectors  [  —  12  0  ]  and  [0  — 12  ]  respectively,  so  that 
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O' 

J  0  -12], 

0 

12 

Here  the  elements  of  N  are  positive,  but  this  is  not  required.  We  could  replace  this  N  matrix  with 
its  negative  here  with  no  harm  done,  as  the  sign  change  would  in  the  end  have  been  compensated  by 
the  associated  sign  reversals  in  the  indices  k"used  to  select  beams  in  particular  directions. 

We  determined  N  from  the  steering  lattice  ^2N”1B+of  possible  values  for  unrestricted  steering 
vector  k"  but  of  course  either,  once  known,  determines  the  other.  The  schema  for  the  current 
example  is  shown  in  Fig.  [4]  where  k"is  selected  using  ([34])  and  the  index  k"  =  [  I  .  ]  determined 

by  the  |  and  grid  lines  associated  with  a  |  grid-line  intersection.  The  |N|  =  122  =  144 
possible  k"  values  in  a  particular  array-factor  period  are  shown  as  dots,  with  the  steering-region  k" 
subset  shown  solid  and  the  rest  shown  hollow.  The  beam-index  vectors  k"  associated  with  all  of 
the  dots  together  are  the  elements  of  a  particular  choice  for  remainder  set  [Z2/Z2N]  that  would  be 
natural  for  the  system  designer. 


A  bricklayer’s  grid  for  the  classic  array.  As  an  alternative,  we  can  give  the  same  classic  Fig.  [3] 
array  design  a  steering  lattice  in  which  unrestricted  steering  vectors  k"are  offset  in  alternate  rows,  in 
brick-wall  fashion.  We  could  have  spaced  steering  vectors  more  closely  in  elevation  than  in  azimuth 
in  the  previous  example  and  did  not,  but  here  we  can  and  will,  simply  to  illustrate  the  approach. 

We  begin  with  a  low-density  pie  I  ini  in  ary  steering  lattice  by  assigning  the  first  and  second  rows 
of  dual  basis  matrix  B+the  beam-index  vectors  [l  —  1  ]  and  [—2  —  2  ]  respectively,  so  that 


N 


prelim  — 


'[  1  -1]‘ 

'-1  1' 

_[-2  — 2]_ 

2  2 

This  will  place  the  unrestricted  steering  vectors  k"on  the  intersections  in  Fig.  [5] of  the  and 
lines  and  results  in  the  least-dense  steering  lattice  ^2Npn[limB+  that  is  a  superlattice  of  boresight 
replication  lattice  Z2B+  while  having  a  brick  layout  and  2  :  1  horizontakvertical  spacing.  Here 
Z2Npi.([iiniB+has  | Npreiim |  =  4  times  the  density  of  Z2B+. 

To  scale  up  the  density  in  what  will  be  an  FFT-friendly  way  in  Section  [4j  we  then  choose  our 
final  steering  lattice  Z2N_1B+to  be  a  superlattice  of  preliminary  steering  lattice  Z2N|ir[|iniB~  by 
scaling  preliminary  density  matrix  Npreiim  on  the  right  with  a  nonsingular  2x2  integer  scaling  matrix 
N scale ■  It  follows  that  N  =  NpreiimNscaie  and  therefore  that  N”1  =  N“^leN”glim,  which  gives  us 
the  superlattice  chain 


;2N-‘B+  =  Z2N-i|eNp-;]imB+  3  Z2N^llmB+  3  Z2B~ 
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Figure  5:  Here  the  same  classic  A/2- 
spaced  square  element  grid  was  used 
as  in  Fig.  [4]  along  with  the  same  steer¬ 
ing  region,  but  steering  density  matrix 
N  was  chosen  to  place  steering  vec¬ 
tors  using  the  offset  rows  of  a  brick¬ 
layer’s  grid  and  with  elevation  spacing 
half  of  vertical  spacing.  The  resulting 
steering- vector  lattice  is  a  slightly  flat¬ 
tened  version,  by  \/3/2  vertical  scal¬ 
ing  actually,  of  a  classic  equilateral- 
triangle  lattice. 


Here  we  use  the  simplest  possible  choice  of  Nscaie,  an  identity  matrix  times  some  highly  composite 
integer,  say  eight: 
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Here  matrix  81  was  used  instead  of  a  scalar  eight  in  order  to  make  explicit  that  N  is  in  a  form 
compatible  with  implementation  of  this  system  with  the  FFT  developed  in  Section[4]  This  scaling  by 
eight  multiplies  each  grid  label  in  Fig.[5]by  eight  as  well.  One  period’s  worth  of  the  resulting  steering 
vectors  are  shown,  with  the  period  chosen  for  convenient  figure  sizing  and  obtained  by  starting  with 
a  large,  horizontally  extended,  diamond-shaped  period  and  cutting  off  and  translating  pieces  by 
integer- vector  left  multiples  of  matrix  N  as  convenient.  There  are  |N|  =  4  x  82  =  256  steering 
vectors  in  the  period,  of  which  the  169  associated  with  the  steering  region  must  be  implemented. 


3.2  The  Classic  A/\/3  triangular- grid  design 

Fig.  [6]  illustrates  the  basis  geometry  of  the  classic  Nyquist  design  on  an  equilateral-triangular  grid 
(generally  termed  a  hexagonal  lattice  in  mathematics  |[6j).  This  geometry  gives  the  densest  possible 
boresight  replication  lattice  -Z2B  1  and  so  results  in  the  lowest  element  density  in  the  plane  of  any 
2D  Nyquist-spaced  lattice  array.  By  (|33|)  that  density  is 
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or  \/3/2  ~  87%  of  the  4/ A2  density  of  the  A/2-spaced  square-grid  array  of  the  last  section.  Here 
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Figure  6:  This  is  the  classic  Nyquist  design  on  an  equilateral-triangular  grid.  The  rows  of  dual 
basis  matrix  B+,  plotted  here  as  long  vectors  using  generalized  direction  cosines  B+P,  determine 
array- factor  periodicity.  Here  they  are  120°  apart  and  of  length  2.  The  columns  of  basis  matrix 
B  are  plotted  here  as  short  vectors  using  generalized  direction  cosines  P7  B  and  are  scaled  by  A 
to  determine  unnormalized  element-location  lattice  AB^2.  Here  those  columns  are  60°  apart  with 
length  1  / \/3  and  so  indicate  A/\/3  element  spacing  on  an  equilateral-triangular  grid. 
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Figure  7:  Example  circular-beam  array  factor,  for  a  A/\/3-spaced  triangular-grid  array,  plotted  as 
magnitude  versus  the  generalized  direction  cosines  kP of  array-plane  vector  k  that  becomes  array- 
plane  DOA  wave  vector  k  inside  the  world-map  visible  region.  Lower  right:  example  of  nonzero 
steering.  Other:  unsteered,  at  three  zoom  levels. 


Aside:  For  numerical-precision  reasons  it  may  sometimes  be  helpful  to  have  radicals  only  in 
direction-cosine  basis  matrix  P  and  in  scalar  scale  factors.  This  can  be  done  by  choosing 
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which  puts  boresight  in  the  [1  1  1]  direction  with 
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An  example  array  factor  for  this  array  geometry  is  shown  in  Fig.  [7]  The  hexagonal-lattice 
character  of  the  boresight  replication  lattice  is  immediately  visible. 
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Figure  8:  This  example 
depicts  the  classic  ap¬ 
proach  to  DFT  beam¬ 
steering  for  the  Fig.  [6] 
Nyquist  design  with 
A/\/3  element  spac¬ 
ing  on  an  equilateral- 
triangle  grid.  In  this 
example  each  DFT  pe¬ 
riod  has  272  =  729 
beam-index  vectors  k", 
with  each  equal  to  27  x 
Only  547  of 
the  associated  steering 
vectors  are  inside  the 
steering  region. 


The  classic  triangular  steering  lattice.  The  two-step  design  procedure  at  the  end  of  Section [3TT] 
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results  in  the  steering  lattice  illustrated  in  Fig.  [8]  The  number  27  used  here  was  arbitrary,  of  course, 
and  in  Section  [4]  we  will  certainly  see  that  a  power  of  three  like  this  is  compatible  with  FFT  re¬ 
alization.  Choosing  N  to  be  a  scalar  multiple  of  identity  matrix  I,  as  was  done  here,  results  in  a 
hexagonal  steering  lattice,  because  boresight  replication  lattice  Z2B+is  hexagonal. 

3.3  The  general  case:  custom  replication  and  steering  lattices 

In  each  example  of  Sections  |3.1|  and  |3.2[  a  circularly  shaped  steering  region  led  to  a  maximum 
amount  of  symmetry  in  boresight  replication  lattice  Z~B  f  and  hence  in  element-location  lattice 
BE 2  But  what  if  the  steering  region  is  more  peculiarly  shaped? 

In  Fig.  [9]  the  array  is  to  be  steered  from  —45°  to  45°  in  azimuth  and  —5°  to  30°  in  elevation. 
Further,  let  us  assume  that  grating  lobes  below  about  —40°  in  elevation  are  harmless  to  system 
operation.  This  could  be  because  the  embedded  pattern  of  the  particular  elements  used  strongly 
suppresses  signals  from  that  part  of  the  visible  region,  or  it  might  be  due  to  some  large  and  largely 
nonscattering  obstruction  in  the  array’s  field  of  view  in  those  directions.  (Is  there  such  a  thing?) 
Maybe  it’s  for  a  combination  of  reasons.  It’s  a  system-design  issue  and  doesn’t  matter  here.  Here 
let  us  simply  suppose  that  it  is  so  and  see  how  we  can  take  advantage  of  it  to  reduce  the  cost  of  the 
array.  And  let  us  also  suppose  that  grating  lobes  are  harmless  at  extreme  vertical  angles. 

In  Fig.  [9]the  replicated  steering  regions  with  penumbra  are  packed  as  tightly  as  possible  consis¬ 
tent  with  grating-lobe  requirements  and  consistent  with  giving  replication  offsets  a  lattice  structure. 
The  latter  is  important.  For  example,  while  lattices  without  vertical-midline  reflection  symmetry 
are  certainly  possible,  here  the  vertical-midline  reflection  symmetry  of  the  steering  region  prevents 
such  a  possibility  from  being  useful.  This  is  because  the  translation  amounts  come  from  boresight 
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Figure  9:  If  grating  lobes  in  some  directions  are  acceptable,  the  less-dense  element-location  lat¬ 
tice  B^2  resulting  from  a  more-dense  boresight  replication  lattice  Z2B  '  may  lower  costs  without 
necessarily  altering  the  range  of  steering-grid  design  alternatives. 


replication  lattice  Z2B+,  and  every  lattice  is  closed  under  negation.  Lowering  the  left  column  of 
pictured  replicas,  for  example,  to  bring  the  “left  shoulder”  replica’s  penumbra  to  the  edge  of  the 
visible  region  would  therefore  require  raising  the  “right  hip”  replica  at  the  same  time,  immediately 
introducing  grating  lobes. 

This  boresight  replication  lattice  Z2B~  has  the  pictured  row  basis  vectors  and  was  determined 
graphically  after  sliding  steering  regions  with  penumbra  around  in  the  software  package  used  to 
make  the  drawing: 
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Basis  B  =  B+t+t  was  then  computed  to  be 


B  =  P 


4 

207 


15 

46 


30 

0 


(36) 


Using  the  same  procedure  as  in  the  previous  examples,  each  box  I  I  in  Fig.  [9]  can  be  given  nroWs 
rows  by  ncois  columns  of  beams  using  steering  density  matrix 
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We  can  see  from  the  column  basis  vectors  of  B  pictured  in  Fig.  [9]that  the  normalized  element- 
location  lattice  B|i2is  more  or  less  a  vertically  stretched  version  of  the  corresponding  lattice  in  the 
equilateral-triangle-grid  design  of  Section  3.2  The  savings  in  element  density  here  is  substantial, 
as  the  density  (|33|)  of  the  unnormalized  element-location  lattice  ABZ2  here  is  ffjj/A2  ~  1.94/A2, 
so  there  are  just  under  two  elements  per  square  box  with  sides  of  length  A.  Relative  to  the  usual  A/2 
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Figure  10:  Example  circular-beam  array  factor  for  a 
custom-grid  array.  Left:  the  element  layout  used,  the 
perimeter  of  which  is  very  slightly  elliptical. 


spacing  on  a  square  grid,  the  reduction  is  nearly  52%,  and  relative  to  A /  vx3  spacing  on  a  equilateral- 
triangular  lattice  the  reduction  is  about  44%.  We  therefore  expect  reductions  of  those  orders  in  the 
number  of  elements  required  to  achieve  a  given  beamshape  and  sidelobe  level. 

By  way  of  example,  a  circular-beam  array-factor  design  comparable  to  those  of  Figs.  [2]  and  [7] 
but  using  the  element  layout  specified  by  p6|)  is  shown  in  Fig.  10  Though  it  was  designed  without 
looking  back  at  the  numbers  of  elements  used  in  the  square-grid  and  triangular-grid  designs  of 
Figs.  0  an  d0  it  turned  out  to  require  437  elements,  areduction  of  about  51.5%  and  45%  respectively 
relative  to  those  two  earlier  designs. 

Each  of  the  element-layout  grids  discussed,  the  square  grid,  the  triangular  grid,  and  this  custom 
grid,  fundamentally  determines  grating-lobe  behavior  but  without  limiting  possible  beam  shapes. 
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Figure  11:  Above:  example  oval-beam  array  fac¬ 
tor  for  a  custom-grid  array.  Left:  the  element  lay¬ 
out  used. 


This  is  illustrated  by  the  Fig.  [IT] design,  which  uses  the  same  custom  grid  as  the  design  of  Fig.  10 


but  aims  instead  for  an  oval  beamshape.  A  significantly  greater  number  of  elements  was  used  in 
order  to  obtain  the  narrower  elevation  beamwidth. 
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4  A  2D  FFT  with  a  matrix  size  parameter 


The  conventional  FFT  algorithm  is  generalized  below,  in  Section  |4.1|  to  allow  the  steering  den¬ 
sity  matrix  N  to  take  up  the  position  in  the  algorithm  usually  played  by  the  number  N  of  FFT 
points.  Because  of  that  connection  N  will  here  be  termed  the  FFT  size  matrix.  Particular  forms 
of  FFT  size  matrix  and  particular  factorizations  thereof  then  give  the  familiar  special  cases  dis¬ 


cussed  in  Section  4.2  which  include  the  conventional  radix-2  and  mixed-radix  FFT’s,  each  in  both 


decimation-in-time  and  decimation-in-frequency  forms,  and  the  2D  FFT  in  its  most  common  form. 


Finally,  the  real  utility  of  the  general  formulation  is  illustrated  in  Section  4.3  which  develops  an 
example  custom  FFT  design  targeted  at  the  array  steering  system  of  the  last,  custom-grid  example 


design  of  Section  3.3 


4.1  General  derivation 


The  remaindered  division  of  an  integer  vector  by  a  square  integer  matrix  discussed  in  Section  2. 1 
is  at  the  heart  of  the  derivation  below,  which  begins  with  the  factorization  of  FFT  size  matrix  N 
and  the  subsequent  decomposition  of  index  column  vector  n  into  a  multiple  of  one  of  the  resulting 
factor  matrices  and  a  vector  remainder.  This  amounts  to  a  single  vector  digit’s  worth  of  radix 
decomposition  of  n  using  a  matrix  radix  and  allows  the  DFT  expression  to  be  factored.  A  similar 
radix  decomposition  of  index  row  vector  k  but  using  the  other  factor  of  N  then  allows  the  factored 
DFT  to  be  re-expressed  using  two  smaller  DFT  expressions. 

The  discussion  here  assumes  size  parameter  N  is  of  size  2x2,  but  only  for  convenience  of 
presentation  and  notation.  The  approach  is  general. 


Factoring  the  size  parameter.  Suppose  nonsingular  2x2  integer  matrix  N,  the  FFT  size,  is 
composite  in  the  sense  that  N  =  LR,  a  product  of  two  2x2  integer  matrices.  What  then  becomes 
of  [zfii2/Nzp-]  and  decomposition  0?  Can  they  be  expressed  in  terms  of  L  and  R? 

In  the  spirit  of  ([T}  but  using  matrix  L  decompose  element  index  n"  £  |i2  as 

n"  =  L  /r  +  £, 

where  £  £  [^2/L^2] .  Then  further  decompose  //  £  ^2as 

fi  =  Rn;  +  m, 

where  m  £  [^2/R^2] .  Together  these  give 

n"  =  L(Rn/+  m)  +  £  =  Nn'+  Lm  +  £, 
which  will  be  just  decomposition  ([[}  with 


n  =  Lm  +  i 


if  set  {Lm  +  t :  m  £  [^2/R^2  ,  £  £  [^2/L^2] }  qualifies  as  a  remainder  set  [|i2/N|i2] . 

It  does  in  fact  qualify.  First,  if2/L{L2]  and  [^2/R^2]  contain  |L|  and  |R|  elements  respectively, 
so  there  are  |L|  |R|  =  |N|  possible  choices  of  (£,  m)  pairs  in  the  decompositions  above.  These  pairs 
yield  distinct  sums  Lm+f  because  Lmi+fi  =  Lm2  +  £2  implies  £1  —£2  =  L(m2  —  mi),  which 
is  disallowed  by  the  definition  of  [^2/L^2]  unless  both  sides  are  zero,  proving  by  contradiction  that 
set  {Lm  +  £ :  m  £  [^2/R^2] ,  £  £  [^2/L^2] }  contains  |N|  elements. 
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Second,  no  two  sums  L m  +  £  differ  by  a  right  multiple  of  N  because  (Lmi  +  £\)  —  (Lm.2  + 
£2)  =  Ni/  =  LRiz  can  be  rewritten  as  L(mi  —  m2  —  Riz)  =  £2  ~  £ 1 ,  which  is  again  disallowed 
by  the  definition  of  [^2/L^2]  unless  both  sides  are  zero,  so  £2  =  £\  and  m\  —  m2  =  Riz.  But  the 
latter  is  also  disallowed  by  the  definition  of  [^2/R^2]  unless  each  of  its  sides  is  zero,  so  m\  =  m2, 
and  we  have  proved  by  contradiction  that  no  two  sums  Lm  +  £  differ  by  a  right  multiple  of  N. 

While  we  could  work  with  arbitrary  remainder  sets  [^2/L^2]  and  [^2/R^2] ,  let  us  instead 
standardize  them  and  take 

EtW]  =  C™J 
r/Rfl  =  {”£,}■ 

Having  established  that  the  defining  requirements  are  met,  we  can  now  take 

P/N?2]  ={L  «  +  }|.  (37) 

We  will  work  both  with  this  specific  [|E2/N|E2]  and  with  They  are  related  through  ([17]), 

which  here  takes  the  form 

=  «Lm  +  l)  mod  N :  m  6  {  ”£,} ,  l  e  {  }  }  (38) 

and  will  soon  become  the  relationship  we'll  use  to  split  the  FFT. 

For  row  vectors  we  could  standardize  the  remainder  sets  [S2/^2L]  and  [Z2/^2 R]  but  will 
not.  The  FFT  designer  can  use  { }  and  j  or  choose  any  convenient  alternative.  Given 

particular  choices  for  those  remainder  sets,  we  then  use 

[Z2/^2N]  =  {sR  +  r :  a  €  [%2/%2L] ,  r  €  [^2/^2n] },  (39) 

and  the  decomposition  for  k  €  [Z2/Z2  N]  that  it  implies. 

Splitting  the  DFT.  Relationships 

n  =  (Lm  +  £)  mod  N  (40) 

n  =  Lm  +  £  -  N n()ffset  (41) 

are  equivalent,  with  the  latter  following  from  the  former  for  some  n()ffset  <G  %2  by  Below  we 
will  use  (|40j)  in  subscripts  and  ([41])  in  superscripts,  the  latter  to  take  advantage  of 

e-j2)rfcN'1n  _  e-j27rA;N^1(Lm+£-Nn(ffset) 

—  e-j27rA;N^1(Lm+£)ej27rfcn'ffset 
_  — j27rfcN^1(Lm+£) 

which  of  course  makes  the  particular  value  taken  by  n()(Tset  irrelevant. 

Using  (|4()|)  and  (|4T|)  then,  the  DFT  in  ([37])  becomes 

V  _  \  '  \  '  — j27rfcN_1(Lm+£) 

2/femodN  /  y  ^(Lm+tJmodN® 

c  r  mod-R  \  f  r  mod-L  \ 

/columns  /  /columns  / 

E  — j27rfcN~1£  \  '  —  j27rfc(LR)“1Lm 

e  2^  ®(Lm+£)modN  e 

/,,/ mod-L  \  mod-R  \ 

lcolumns/  /columns/ 

E  —  j27rfcN~1£  \  '  — j27rfcR_1m  (A'1\ 

6  /  ^  ^(Lm+tJmodN®  •  (44) 

£  r  mod-L  1  m  r  mod-R  \ 

lcolumns/  lcolumns/ 
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Figure  12:  The  structure  (|43|)  of  a  general  FFT  with  size  matrix  N,  assuming  N  =  LR  and  using 
component  FFT’s  with  size  matrices  L  and  R  and  drawn  here  for  |  det(L)|  =  3  and  |  det(R)|  =  4. 
Twiddle  factors  e_J27rrN  ^or,  equivalently,  e~i27r(>r<  ,(L  ^  are  applied  between  the  FFT  steps. 


The  general  form  of  the  FFT.  Decomposition  k  =  sR  +  r  with  s  e  [Z2/Z2L]  and  r  G 
[Z2/Z2R]  is  implied  by  ([39]).  Using  it  to  rewrite  ([42])  results  in 


E  — j27r(sR+r)N  9  \  ' 

6  /  ,  ^(Lm+t)modN  ® 

£  r  mod-L  I  (  mod-R 

IcolumnsJ  t\columnsi 


— j27r(sR+r)R  1m 


E_-j27rsR(LR)-9  _-j27rrN-9  ™ 

e  e  X(Lm+t)modNe 


— ]2'Ksm  —  j27rrR  1m 


rnocl- Li  J 
IcolumnsJ 


=  E 

r  mod-L  i  L 
^  IcolumnsJ 


rne{m,  ) 

IcolumnsJ 


3-j27r(r-R-1)(L-9) 


E 


^'(Lm+£)  mod  N 


a— j27rrR  1m 


m  £ 


r  mod-R  J 
IcolumnsJ 


;i— j27rsL  1£ 


(43) 


Some  interpretation  of  ([43])  will  reveal  an  FFT  structure.  The  inner  sum  is  a  DFT  of  matrix  size 


R  of  x 


(Lra+£)  mod 


N  taken  as  a  function  of  m,  with  £  treated  as  a  parameter  and  where  the  “mod  N” 


is  unnecessary,  strictly  speaking,  because  of  the  periodicity  of  weighted  and  folded  element  outputs 
xn  demonstrated  at  the  end  of  Section  [23]  This  inner-sum  DFT  is  a  function  of  r  and  £  only  and 
can  be  thought  of  as  |  det(L)  |  output  sets  of  |  det(R)  |  points  each.  After  multiplication  by  so-called 
twiddle  factors  (standard  name)  e“j2"'rR  :  1  L  f  ]  or,  equivalently,  e_j27rrN  £  these  FFT  output 
sets  are  taken  collectively  to  be  a  function  of  £  with  r  as  a  parameter  and  subjected  in  the  outer  sum 
to  DFT’s  of  matrix  size  L,  one  for  each  value  of  parameter  r,  resulting  in  |  det(R)|  output  sets  of 
|  det (L)  |  points  each. 

The  FFT  of  (43 1  is  shown  in  block-diagram  form  in  Fig.[l2]for  the  specific  case  |  det(L)|  =  3 
and  |  det(R)|  =  4,  which  numbers  then  become  the  sizes  of  the  component  matrix-size  FFT’s 
shown  as  boxes.  Here  the  optional  “mod  N”  steps  on  element  (input)  and  beam  (output)  index 
vectors  are  omitted  for  readability,  and 


column  vectors: 
row  vectors: 


[^2/R^2]  =  {mi,  m2,  m3,  m4} 
[Z2/Z2R]  =  In,  r2,  r3,  r4} 


[?2/i42]  ={e1,e2,e3} 

[Z2/Z2L]  ={S1,S2,S3}. 


The  component  FFT's  on  the  left  are  all  identical,  as  are  the  component  FFT's  on  the  right.  These 
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component  FFT’s  can  be  implemented  directly  from  DFT  expression  (pT]  or,  if  their  size  matrices 
arc  factorable,  expressed  as  smaller  FFT’s  using  ([43])  recursively. 


4.2  Familiar  special  cases 

Special  case:  the  ordinary  ID,  radix-2  FFT.  Suppose  N  =  [2A  o]_  Factorization  N  =  LR  = 

[gsnr^with 


^2/R?2]  =  =  {  [  8  ]...:[  2^-1  ]  } 

52/L?2]  =  [^2/S2L]T={[0],[i]} 


gives  the  FFT  of  (43 1  and  Fig.  12  an  input  stage  that  is  a  pair  of  2A  1  point  FFT’s  and  an  output 
stage  comprising  2A  -1  transforms  of  two  inputs  to  two  outputs,  each  of  which,  by  (|3T|),  is  the  classic 


butterfly  (standard  name)  computation,  a  sum  and  a  difference.  The  twiddle  factor  shown  in  Fig.  12 
here  becomes 

e-j27rrN-1£  =  e~}2nr  [  V2*  °]*  =  e_j2^n^/2 K +r2t2  =  Q-]2-Krxixl2K  . 

Recursively  decomposing  the  input  FFT’s  into  smaller  FFT’s  until  the  input  FFT's  are  nothing 
but  butterflies  then  yields  the  classic  radix-2  decimation-in-time  (in  this  beamsteering  application, 
decimation-in-element-space)  FFT. 


Alternatively,  let  N  =  LR  =  [  ■ 


'20‘ 
.0  1. 


with 


[?2/r?2]=  [z2/z2r]7’={[8  ],[£]} 
[?W]  =  [z2/z2l]t  =  {[§]...[ 


2K~i- 

0 


-1]}- 


The  FFT  of  ([43])  and  Fig.  12  then  has  an  input  stage  comprising  2  A  ~ 1  transforms  of  two  inputs  to  two 
outputs,  butterflies,  and  an  output  stage  that  is  a  pair  of  2A  _1  point  FFT’s.  The  twiddle  factors  are  as 
before.  Recursively  decomposing  the  output  FFT’s  into  smaller  FFT’s  until  nothing  but  butterflies 
remain  yields  the  classic  radix-2  decimation-in-frequency  (here  decimation-in-beam-space)  FFT. 


Special  case:  an  ordinary  ID,  mixed-radix  FFT.  Suppose  N  =  |"3x2A  ? 
LR=  [3  0]  [2*0]  with 


Factorization  N  = 


[p/Rp]=  [zy^Rp = {[§] . . .  p*-1]} 

[P/LP]=  [Z2/Z2L]t  ={[»].  [■],[§]} 


gives  the  FFT  of  ([43])  and  Fig.  [T2]specialized  to  the  structure  shown  on  the  left  in  Fig.  13  The  input 
stage  comprises  three  2K  point  FFT’s,  and  the  output  stage  uses  2K  transforms  of  three  inputs  to 
three  outputs,  each  of  which,  by  (|3T|),  takes  the  form  of  the  of  three-winged  dragonfly  (nonstandard 


name)  on  the  right  in  Fig.  13  The  twiddle  factor  here  becomes 


-j2-nrN  xl  _  g  j27rr  1/(3x  2  )  _  ^-]2irr\h/ (3x2A')+r2A  _  e-j27rrit?i/(3x2A ) 


The  three  input  FFT’s  can  be  structured  in  classic  radix-2  form  as  per  the  previous  discussion. 
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Figure  13: 
with 


Left:  This  12 -point  FFT,  in  effect  in  one  dimension,  was  created  from  (|43j) 
L=[30]  and  R  =  [  o  1  ] 


and  uses  three  four-point  FFT’s  followed  by  four  three-point  DFT’s.  In  terms  of  indices  r\  and  £\, 
twiddle  factors  e-j27rri^i/12  evaluate  as  shown.  Right:  the  three-point  “dragonfly”  DFT  used  four 
times  on  the  left. 


Alternatively,  let  N  =  LR  =  [  2A  5  ]  [  o  l  ]  wdh 


[P/R?2]=  [z7s2R]t={[§MoMo]} 

[F/Lf2]  =  [Z2/Z2L]T=  {[§]  ■  .  .  t2*-1]} 


The  FFT  of  (|43|)  and  Fig. 


12 


then  has  an  input  stage  comprising  2K  three-point  DFT’s,  each  a 
dragonfly  mapping  three  inputs  to  three  outputs,  and  an  output  stage  that  is  three  2h  point  FFT’s. 
The  twiddle  factors  are  as  before. 


Special  case:  the  ordinary  2D  FFT  using  row  and  column  FFT’s.  Suppose  N  =  ^  ]  = 

L*=[^?nu]- 

[?2/R?2]  =  [S2/^2R]  T=  {  [  8  ]  •  •  •  [  AT,0-!  ]  } 

[r/L?2]  =  [^2/^2L]r= {[8]  •  •  ■  [vu 

As  illustrated  in  Fig.  [14] for  Ni  =  3  and  A^  =  4,  this  yields  N\  one-dimensional  Appoint  FFT’s 
on  input  rows  followed  by  N2  one-dimensional  N) -point  FFT’s  to  produce  output  columns.  The 
twiddle  factors  e~j27rrN  1  in  the  center  are  equal  to  unity. 


4.3  Custom  FFT  design 


3.3 


system  design  (page  27 1  featured  N  =  LR  =  [ 


0  -2' 


??cols  0 
0  flrows  - 


Here  let  us 


The  Section 

develop  the  corresponding  FFT  structure. 

We  will  need  various  remainder  sets.  In  the  middle  of  Section  2.1  we  developed  example  re¬ 
mainder  sets  mod  N  geometrically  using  an  example  steering  density  matrix  N  that  here  is  exactly 
our  matrix  L,  so  (fTO])  from  that  discussion  now  yields 


{«3££}  =  {[8L[-S]} 


CtsL}  =  {[0  0],  [0  -!]}. 
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Figure  14:  The  structure  of  the  FFT  of  ([43])  when 

L=[oi]  and  R=  [or]' 

This  choice  of  L  and  R  yields  three  ID  FFT’s  on  rows  (the  image  is  transposed  relative  to  con¬ 
vention)  followed  by  four  ID  FFT’s  on  columns.  Swapping  L  and  R  would  instead  yield  four  ID 
FFT’s  on  columns  followed  by  three  ID  FFT’s  on  rows  and  using  unit  twiddle  factors  in  between. 


If  we  then  take  ncois  =  24  and  nmws  =  12  for  the  sake  of  specificity  so  that  R 
use  the  straightforward  graphical  approach  of  Fig.  15  to  arrive  at 

[?W]  =  CKZ)  =  {  [8].- 


_  r  24  0 1 

~  L  0  12 J> 


we  can 


[2o3L 


[Z2/S2R]  = 


f  mod-R 
l  rows 


[n]  [11]  }  =  {[m2]  :  =  0,...,23andm2  =  0,...,ll} 

}  =  {  [00]  ,...,[230], 


[0  11], . . .  ,[23  11]  }  =  {[ri  r2]:  n  =  0, . . .  ,23  and  r2  =  0,. . .  ,11} 


«££}  as  illustrated. 


The  special  FFT  is  sketched  in  Fig.  16  which  shows  |L|  =  2  FFT’s  of  matrix  size  R  followed 


by  |R|  =  24  x  12  =  288  FFT’s  of  matrix  size  L.  The  latter,  by  ([31]),  have  |L|  =  2  inputs  and 
|L|  =  2  outputs,  and  the  complex  exponentials  in  ([3Tj)  here  (with  L  replacing  N  there)  become  ±1 
and  make  these  matrix-size  L  DFT’s  into  standard  butterflies.  The  |  L I  =2  FFT’s  of  matrix  size  R 


are  each  further  decomposed,  in  the  spirit  of  Fig.  14  into  24  one-dimensional  12-point  column-wise 
FFT’s  followed  by  12  one-dimensional  24-point  row-wise  FFT’s. 


Twiddle  factors  e-j27r(rR  XNL  H) 
and,  from  them. 


are  calculated  using  L  1  =  |[_}  q]  and  R  1  =  53:  [02 


=  {[§].![!]} 

rR-1  G  ([ri  r,]i[l  0]  :  n  =  0,...,23andr2  =  0, 11) 

when  r  —  0  wlKn  1  =  [ 4] 
rR_1L_1^  G  {0}  U  {[ri  ^2  ] ^  [  1  0 ]  [1]  :  n  =  0, 


,  23  and  r2  =  0, 


■n} 
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Figure  15:  Above:  example  graphical  determina¬ 
tion  of  column  and  row  remainder  sets  mod  R  us¬ 
ing  0  and  (fT2l>  respectively  for  R  =  [  204  ] . 

Right:  an  example  graphical  determination  of 
{columns}  using  Q,  for  N=[_2»-“], 


=  {0}  IJ  | ;  n  rs  ]  [ 2 ]/48 :  n  =  0, . . . ,  23  and  r2  =  0, . . . ,  111. 

Using  (|39|)  for  the  first  equality,  the  output  index 
k  e  [S2/S2N]  =  |sR+  r:  s  <E  [%2/%2L],r  €  [Z2/Z2R]  } 

=  {s[204  i°2]  +  r:  s  e  {[ o  o],  [o  -1  ]},r  e  {[n  r2]:r1  =  0, ...  ,23  and  r2  =  0, . . .  ,11}  j 
=  js'-f  r:  s' 6  {[ o  o],  [o  -12]},  r  e  {[n  r2] :  ri  =  0, . . .  ,23  and  r2  =  0, . . . ,  11} | 

=  j[ai  0:2  ] :  a\  =  0, . . . ,  23  and  a2  =  0, . . . ,  23  j 


steps  straightforwardly  through  0, . . . ,  23  in  each  coordinate.  The  input  XLm+£,  however,  is  indexed 
by  a  vector  that,  due  to  the  influence  of  L  jumps  through  index  space  as  the  components  of  vector 
m  are  stepped.  Thus  the  Fig.  16  structure  can  be  seen  to  be  a  decimation-in-input-space  structure, 
or,  in  the  relevant  application  context  here,  a  decimation-in-element-space  structure. 


5  Concluding  Remarks 

Steering  a  planar  receive  array  in  many  beam  directions  simultaneously  is  most  efficiently  done 
using  an  array  with  element  centers  laid  out  on  a  lattice,  because  that  enables  many-beam  steering 
using  an  FFT  structure  to  process  the  element  outputs.  This  paper  is  intended  to  include  enough 
detail  and  example  material  to  equip  a  determined  system  designer  with  the  tools  needed  to  design 
such  a  system  without  the  artificial,  unnecessary,  and  in  the  end  expensive  constraints  often  included 
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"[23  O] 


[23  ll] 


l[23  12] 


[23  23] 


Figure  16:  Example  custom  decimation-in-element-space  beamsteering  FFT  for  the  system  design 
example  of  Fig.  [9j  The  optional  mod  N  operation  on  the  input  subscript  vector  isn’t  shown,  but  if 
desired  would  use  the  set  identified  in  Fig. [15]  Computation  is  shown  for  all  2x24x12  = 

576  distinct  beams,  but  outputs  corresponding  to  beams  not  in  the  steering  region  would  not  be 
computed  in  practice. 


in  such  designs  for  the  sake  of  simplicity.  This  release  from  such  constraints  is  of  critical  importance 
in  arrays  of  this  type,  which  generally  require  expensive  digital  receivers  behind  each  element.  This 
material  in  this  paper  is  at  the  heart  of  minimizing  the  number  of  those  receivers  and  of  the  amount 
of  hardware  required  for  subsequent  beamsteering. 

This  paper  is  not  intended,  however,  as  a  Treatise  From  God  detailing  step-by-step  what  the 
designer  must  do  and  making  the  designer  more  or  less  unnecessary  in  the  process.  Instead,  consid¬ 
erable  room  is  left  for  creative  use  of  the  body  of  concepts  and  approaches  developed  and  illustrated 
here.  Further,  the  reader  is  not  asked  to  take  the  author’s  word  on  theoretical  foundations  but  is 
instead  presented  with  the  mathematical  theory  in  enough  detail  to  permit  verification  of  its  correct¬ 
ness  (or  indeed  to  make  corrections  if  needed).  A  signal-processing  perspective  governs  throughout, 
rather  than  the  more  traditional  electromagnetic  one.  Antenna-community  readers  might  find  this  an 
irritating  inconvenience,  but  it  serves  the  honorable  purpose  of  greatly  simplifying  the  development. 
As  unsimple  as  the  paper  surely  does  seem  in  the  end,  it  would  have  been  far  worse  if  attempted 
without  the  basis  matrices,  remainder  sets,  4D  Fourier  transforms,  etc.  used  so  freely  throughout. 

Covered  in  detail  are  the  intimately  related  topics  of  the  specific  FFT  design,  the  steering  grid  of 
beam  directions,  the  grating-lobe  locations  in  beamspace,  and  the  element-location  lattice  chosen. 
The  FFT  design  is  based  on  a  matrix  and  its  factorization  that  together  correspond  to  a  choice  of  a 
sublattice  of  a  sublattice  of  a  lattice,  and  the  FFT  mathematics  is  then  based  in  a  deep  way  on  coset 
decompositions  in  this  chain  of  nested  sublattices.  To  make  this  development  self-contained  and 
enabling  of  a  general  design  approach,  the  associated  coset  mathematics  had  to  be  included  here, 
but  in  order  that  the  reader  not  need  a  background  in  group  theory,  that  material  was  developed  here 
in  a  self-contained  way  and  specialized  to  the  case  at  hand.  The  cosets  were  not  dealt  with  in  their 
usual  generality,  but  instead  the  entire  topic  became  a  matter  of  remaindered  division  of  an  integer 


38 


vector  by  an  integer  matrix.  While  that  material  will  prove  the  most  challenging  part  of  the  paper 
for  the  typical  reader,  considerable  effort  went  into  attempting  to  making  the  material  accessible  to 
an  MSEE  engineer  or  to  an  experienced  BSEE  engineer  with  a  nonphobic  attitude  to  mathematics. 

It  is  natural  to  be  reading  this  at  first  on  a  look-ahead  basis,  but  this  author  does  hope  that  some 
readers  have  sufficiently  robust  reserves  of  engineering  fortitude  to  be  tempted  to  go  back  and  work 
through  the  material  of  this  paper  carefully.  Only  thus  can  the  substantial  benefits  of  the  basis-matrix 
notational  framework  to  careful  thinking  about  arrays  be  fully  appreciated.  It  is  in  part  towards  that 
end  that  so  many  array  basics  are  developed  in  detail  in  this  paper. 
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